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Lyme Disease Vaccines 



Field of, the Invention 

The present invention relates to novel vaccines for the prevention or attenuation of Lyme 
disease. The invention further relates to isolated nucleic acid molecules encoding antigenic " 
10 polypeptides of Borrelia burgdorferi. Antigenic polypeptides are also provided, as are vectors, 
host cells and recombinant methods for producing the same. The invention additionally relates to 
diagnostic methods for detecting Borrelia gene expression. 

Background of the Invention 

Lyme disease (Steere, A.C., Proc. Natl. Acad. Sci. USA 97:2378-2383 (1991)), or Lyme 
borreliosis, is presently the most common human disease in the United States transmitted by an 
arthropod vector (Center for Disease Control, Morbid. Mortal. Weekly Rep. 46(23 ):53 1-535 
(1997)). Further, infection of house-hold pets, such as dogs, is a considerable problem. 

While initial symptoms often include a rash at the infection point, Lyme disease is a 
multisystemic disorder that may include arthritic, carditic, and neurological manifestations. While' 
antibiotics are currentiy used to treat active cases of Lyme disease, B. burgdorferi persists even 
after prolonged antibiotic treatment. Further, B. burgdorferi can persist for years in a mammalian 
25 host in the presence of an active immune response (Straubinger, R. et aL, J. Clin. Microbiol. 
\ 55:111-116 (1997); Steere, A., N. Engl J, Med. 327:586-596 (1989)). 

Lyme disease is caused by the related tick-borne spirochetes classified as Borrelia 
burgdorferi sensu lato (including B. burgdorferi sensu stricto, B. dfzelii. B. garinii). Although 
substantial progress has been made in the biochemical, ultrastructural, and genetic characterization 
30 of the organism, the spirochetal factors responsible for infectivity, immune evasion and disease 
pathogenesis remain largely obscure. 

• A number of antigenic B. burgdorferi cell surface proteins have been identified. These 
include the outer membrane surface proteins (Osp) OspA, OspB, OspC and OspD. OspA and 
OspB are encoded by tightiy linked tandem genes which are transcribed as a single transcriptional 
35 unit (Brusca, J. et aL, J. BacterioL 773:8004-8008 (1991)). The most-studied B. burgdorferi 
membrane protein is OspA, a lipoprotein antigen expressed by borreliae in resting ticks and the 
most abundant protein expressed in vitro by most borrelial isolates (Barbour, A.G., et al. t . 
Infection & Immunity 4 1:1 95 -804 (1983); Howe, T.R., et aL, Science 227:645 (1985)). 



A number of different types of Lyme disease vaccines have been shown to induce 
immunological responses. Whole-cell B. burgdorferi vaccines, for example, have been shown to 
induce both immunological responses and protective immunity in several animal models. . 
(Reviewed in Wormser, G., Clin. Infect. Dis. 27il267-1274 (1995)). Further, passive immunity 
has been demonstrated in both humans and other animals using B. burgdorferi specific antisera. 

While whole-cell Lyme disease vaccines confer protective immunity in animal models, use 
of such vaccines presents the risk that responsive antibodies will produce an autoimmune 
response (Reviewed in Wormser, G., supra). This problem is at least partly the result of the • 
production of B. burgdorferi specific antibodies which cross-react with hepatocytes and both 
muscle and nerve cells. B. burgdorferi heat shock proteins and the 41-kd flagellin subunit are 
-believed to contain antigens which elicit production of these cross-reactive antibodies. 

Single protein subunit vaccines for Lyme disease have also been- tested, the cell surface 
proteins of B. burgdorferi are potential candidates for use in such vaccines and several have been 
shown to elicit protective immune responses in mammals (Probert, W. etaP Vaccine 75:15-19 - 
(1997); Fikrig, E. et al, Infect. Immun. (53:1658-1662 (1995); Langerman S. et al. Nature 
372:552-556 (1994); Fikrig, E. et al, J. Immunol. 148:2256-2260 (1992)). Experimental OspA 
vaccines, for example, have demonstrated efficacy in several animal models (Fikrig, E., et al., 
Proc. Natl. Acad. ScL USA 59:5418-5421 (1992); Johnson, B.J., et al, Vaccine 73:1086-1094 
(1996); Fikrig, E., et al, Infect. Immun. 60:657-661 (1992); Chang, Y.F., et al, Infection & 
Immunity 63:3543-3549 (1995)), and OspA vaccines for human use are under clinical evaluation - 
(Keller, D., et al, J. Am. Med. Assoc. 277:1764-1768 (1994); Van Hoecke, C, et al, Vaccine 
14: 1620-1626 (1996)). Passive immunity is also conferred by antisera containing antibodies 
specific for the full-length OspA protein, Further, vaccination with plasmid DNA encoding OspA 
has been demonstrated to elicit protective immune responses in mice (Luke, C. et al, J. Infect. 
Dis. 775:91-97 (1997); Zhong,~W. et al, Eur. J. Immunol. 26:2749-2757 (1996)). 

- Recent immunofluorescence assay observations indicate that during tick engorgement- the 
expression of OspA by borreliae diminishes (deSilva, A.M., et al, J. Exp. Med. 733:271-275 
(1996)) while expression of other proteins, exemplified by OspC, increases (Schwan, T.G., et 
al, Proc. Natl. Acad. Sci. USA 92:2909-2913 (1985)). By the time of transmission to hosts, 
spirochetes in the tick salivary glands express little or no OspA. This down-modulation of OspA 
appears to explain the difficulties in demonstrating immune responses to this antigen early m . 
infection following tick bites (Kalish/R.A., et al, Infect. Immun, 63:2228-2235 (1995); Gern, 
L., et al, J. Infect. Dis. 767:971-975 (1993); Schiable, U.E., et al, Immunol. Lett. 36:219-226 
(1993)) or following challenge with limiting doses of cultured borreliae (Schiable, U.E., et al, 
Immunol Lett. 36:219-226 (1993); Barthold, S.W. and Bockenstedt, L.K., Infect. Immun. 

67^4696-4702 (1993)). . 

Furthermore, OspA-specific antibodies are ineffective if administered after a borrelial 
challenge delivered by svringe (Schiable, U.E., et al, Proc. Natl Acad. Sci. USA S7:3768-3772 
(1990)) or tick bite (deSilva, A.M., et al, J. Exp. Med. 183:21 1-275 (1996)). To be efficacious, 
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OspA vaccines must elicit protective levels of antibody which are maintained throughout periods 
of tick exposure in order to block borrelia transmission from the arthropod vector. 

Vaccines in current use against other pathogens include in vivo-expressed antigens which 
could boost anamnestic responses upon infection, potentiate the action of immune effector cells 
and'complement, and inhibit key virulence mechanisms. OspC is both expressed during infection 
(Montgomery, R.R., et al., J.Exp. Med. 183:261-269 (1996)) and a target for protective- 
immunity (Gilmore, R.D., et al., Infect. Immun. 54:2234-2239 (1996); Probert, W.S. and 
LeFebvre, R.B., Infect. Immun. 62: 1920-1926 (1994); Preac-Mursic, V., et al., Infection 
20:342-349 (1992)), but mice immunized with this protein were only protected against challenge 
with the homologous borrelial isolate (Probert, W.S., et al., J. Infect. Dis. 775:400-405 (1997)). 
Identification of in vivp-expressed, and broadly protective, antigens of B. burgdorferi has 
remained elusive. 

Summary of the Invention 

The present invention provides isolated nucleic acid molecules comprising polynucleotides 
. encoding the B. burgdorferi peptides having the amino acid sequences shown in Table 1. Thus, . 
one aspect of the invention provides isolated nucleic acid molecules comprising polynucleotides 
having a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence 
encoding any of the amino acid sequences of the full-length polypeptides shown in Table 1; (b) a 
nucleotide sequence encoding any of the amino acid sequences of the full-length polypeptides . 
shown in Table 1 but minus the N-terminal methionine residue, if present; (c) a nucleotide 
• sequence encoding any of the amino acid sequences of the truncated polypeptides shown in Table 
1; and (d) a nucleotide sequence complementary to any of the nucleotide sequences in (a), (b), or 

(c) above. t ' 

Further embodiments of the invention include isolated nucleic acid molecules that 
comprise a polynucleotide having a nucleotide sequence at least 90% identical, and more 
preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the nucleotide sequences in 
(a), (b), (c), or (d) above, or a polynucleotide which hybridizes under stringent hybridization 
conditions to a polynucleotide in (a), (b), (c), or (d) above. This polynucleotide which hybridizes 
does not hybridize under stringent hybridization conditions to a polynucleotide having a 
nucleotide sequence consisting of only A residues or of only T residues. Additional nucleic acid 
embodiments of the invention relate to isolated nucleic acid molecules comprising polynucleotides 
which encode the amino acid sequences- of epitope-bearing portions of a B. burgdorferi 
polypeptide having an amino acid sequence in (a), (b), or (c) above. 

The present invention also relates to recombinant vectors, which include the isolated 
nucleic acid molecules of the present invention, and to host cells containing the recombinant 
■ vectors, as well as to methods of making such vectors and host cells and for using these vectors 
for the production of B. burgdorferi polypeptides or peptides by recombinant techniques. 

The invention further provides isolated B burgdorferi polypeptides having an amino acid 
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sequence selected from the group consisting of: (a) an amino acid sequence of any of the full- 
length polypeptides shown in Table 1 ; (b) an amino acid sequence of any of the full-length 
polypeptides shown in Table 1 but minus the N-terminal methionine residue, if present; (c) an 
amino acid sequence of any of the truncated polypeptides shown in Table 1; and (d) an amino acid 
sequence of an epitope-bearing portion of any one of the polypeptides of (a), (b), or (c). 

The polypeptides of the present invention also include polypeptides having an amino acid 
sequence with at least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 
96%, 97%, 98%, or 99% similarity to those described in (a), (b), (c), or (d) above, as well as 
polypeptides having an amino acid sequence at least 70% identical, more preferably at least 75% 
identical, and' still more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to 
those above; as well as isolated nucleic acid molecules encoding such polypeptides. 

The present invention further provides a vaccine, preferably a multi-component vaccine 
comprising one or more of the B. burgdorferi polypeptides shown in Table 1 , or fragments 
thereof, together with a pharmaceutical^ acceptable diluent, carrier, or excipient, wherein the 
B. burgdorferi polypeptide(s) are present in an amount effective to elicit an immune response to 
members of the Borrelia genus in an animal. . The B. burgdorferi polypeptides of the present 
invention may further be combined with one or more immunogens of one or more other borrelial 
or non-borrelial organisms to produce a multi-component vaccine intended to elicit an 
immunological response against members of the Borrelia genus and, optionally, one or more non- 
borrelial organisms. - 

The vaccines of the present invention can be administered in a DNA form, e.g., "naked" 
DNA, wherein the DNA encodes one or more borrelial polypeptides and, optionally, one or more 
polypeptides of a non-borrelial organism. The DNA encoding one or more polypeptides may be 
constructed such that these polypeptides are expressed fusion proteins. 

The vaccines of the present invention may also be administered as a component of a 
genetically engineered organism. Thus, a genetically engineered organism which expresses one 
or more B. burgdorferi polypeptides may be administered to an animal. For example, such a 
genetically engineered organism may contain one or more B. burgdorferi polypeptides of the 
present invention intracellular^, on its cell surface, or in its periplasmic space. Further, such a 
genetically engineered organism may secrete one or more B. burgdorferi polypeptides. 

The vaccines of the present invention may be co-administered to an animal with an 
immune system modulator (e.g., CD86 and GM-CSF). 

The invention also provides a method of inducing an immunological response in an animal 
to one or more members of the Borrelia genus, e.g., B. burgdorferi sensu stricto, B.afzelii, and 
B. garinii, comprising administering to the animal a vaccine as described above. 

The invention further provides a method of inducing a protective immune response in an 
animal, sufficient to prevent or attenuate an infection by members of the Borrelia genus, 
comprising administering to the animal a composition comprising one or more of the polypeptides 
shown in Table 1, or fragments thereof. Further, these polypeptides, or fragments thereof, may 



be conjugated to another immunogen and/or administered in admixture with an adjuvant. 

The invention further relates to antibodies elicited in an animal by the administration of one 
or more B. burgdorferi polypeptides of the present invention. 

' The invention also provides diagnostic methods for detecting the expression of genes of 
members of the Borrelia genus in an animal. One such method involves assaying for the 
expression of a gene encoding Borrelia peptides in a sample from an. animal. This expression 
may be assayed either directly {e.g., by assaying polypeptide levels using antibodies elicited in 
response to amino acid sequences shown in Table 1) or indirectly {e.g., by assaying for 
antibodies having specificity for amino acid sequences shown in Table 1). An example of such a 
method involves the use of the polymerase chain reaction (PCR) to amplify and detect Borrelia 

nucleic acid sequences. ' 

The present invention also relates to nucleic acid probes having all or part of a nucleotide 
sequence shown in Table 1 which are capable of hybridizing under stringent conditions to 
Borrelia nucleic acids. The invention further relates to a method of detecting one or more Borrelia 
nucleic acids in a biological sample obtained from an animal, said one or more nucleic acids, 
encoding Borrelia polypeptides, comprising: 

.' . a) contacting the sample with one or more of the above-described nucleic acid probes, 
under conditions such that hybridization occurs, and 

b) detecting hybridization of said one or more probes to the Borrelia nucleic acid present 
in the biological sample. 

Detailed Description 

The present invention relates to recombinant antigenic B. burgdorferi polypeptides and 
fragments thereof. The invention also relates to methods' for using these polypeptides to produce 
immunological responses and to confer immunological protection to disease caused by members 
of the genus Borrelia. The invention further relates to nucleic acid sequences which encode 
antigenic B: burgdorferi polypeptides and to methods for detecting Borrelia nucleic acids and 
polypeptides in biological samples. The invention also relates to Borrelia specific antibodies and 
methods for detecting such antibodies produced in a host animal. 

Definitions 

The following definitions are provided to clarify the subject matter which the inventors 
consider to be the present invention. 

As used herein, the phrase "pathogenic agent" means an agent which causes a disease state 
or affliction in an animal. Included within this definition, for examples, are bacteria, protozoans, 
fungi, viruses and metazoan parasites which either produce a disease state or render an animal 
infected with such an organism susceptible to a disease state {e.g., a secondary infection). 
Further included are species and strains of the genus Borrelia which produce disease states in 
animals. 
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As used herein, the term "organism" means any living biological system, including 
viruses, regardless of whether it is a pathogenic agent. 

As used herein, the term "Borrelia" means any species or strain of bacteria which is 
members of the genus Borrelia. Included within this definition are Borrelia burgdorferi sensu lato 
(including B. burgdorferi sensu stricto, B. afzelii, B. garinii), B. andersonii, B. anserine, B. 
japonica, B. coriaceae, and other members of the genus Borrelia regardless of whether they are 
known pathogenic agents. 

As used herein, the phrase "one or more B. burgdorferi polypeptides of the present 
invention" means the amino acid sequence of one or more of the B. burgdorferi polypeptides 
disclosed in Table 1 . These polypeptides may be expressed as fusion proteins wherein the 
B. burgdorferi polypeptides of the present invention are linked to additional amino acid 
sequences which may be of borrelial or non-borrelial origin. This phrase further includes 
fragments of the B. burgdorferi polypeptides of the present invention. 

As used herein, the phrase "full-length amino acid sequence" and "full-length polypeptide" 
refer to an amino acid sequence or polypeptide encoded by a full-length open reading frame 
(ORF). An ORF may be defined as a nucleotide sequence bounded by stop codons which 
encodes a putative polypeptide. An ORF may also be defined as a nucleotide sequence within a 
stop codon bounded sequence which contains an initiation codon (e.g., a methionine or valine 
codon) on the 5' end and a stop codon on the 3' end. 

As used herein, the phrase "truncated amino acid sequence" and "truncated polypeptide" 
refer to a sub-sequence of a full-length amino acid sequence or polypeptide. Several criteria may 
also be used to define the truncated amino acid sequence or polypeptide. For example, a truncated 
polypeptide may -be defined as a mature polypeptide (e.g., a polypeptide which lacks a leader 
sequence). A truncated polypeptide may also be defined, as an amino acid sequence which is a 
portion of a longer sequence that has been selected for ease of expression in a heterologous 

system but retains regions which render the polypeptide useful for use in vaccines (e.g., antigenic 
regions which are expected to elicit a protective immune response). 

Additional definitions are provided throughout the specification. 
Explanation of Table 1 

Table 1 lists B. burgdorferi nucleotide and amino acid sequences of the present invention. 

The nomenclature used therein is as follows: 

"nt" refers to nucleotide sequences; 

"aa" refers to amino acid sequences; . 

"f refers to full-length nucleotide or amino acid sequences; and 

"t" refers to truncated nucleotide or amino acid sequences. 
. Thus, for example, the designation "f 10 1 .aa" refers to the full-length amino acid sequence 
of B. burgdorferi polypeptide number .101. Further, "flOl.nt" refers to the full-length nucleotide 
sequence encoding the full-length amino acid sequence of B. burgdorferi polypeptide number 
101. . 
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Explanation of Table 2 

■' Table 2 lists accession numbers for the closest matching sequences between the 
polypeptides of the present invention and those available through GenBank and GeneSeq 
5 databases. These reference numbers are the database entry numbers commonly used by those of 
skill in the art, who will be familar with their denominations. The descriptions of the 
numenclature for GenBank are available from the National Center for Biotechnology Information. 
Column 1 lists the gene or ORF of the present invention. Column 2 lists the accession number of 
a "match" gene sequence in GenBank or GeneSeq databases. Column 3 lists the description of 
10 the "match" gene sequence. Columns 4 and 5 are the high score and smallest sum probability, 
respectively, calculated by BLAST. Polypeptides of the present invention that do not share 
m significant identity/similarity with any polypeptide sequences of GenBank and GeneSeq are not 
|j represented in Table 2. Polypeptides of the present invention that share significant . 
y identity/similarity with more than one of the polypeptides of GenBank and GeneSeq are 
;rf5 . represented more 1 than once. 

y ■ " ■ - 

^ Explanation of Table 3. 

p.| The B. burgdorferi polypeptides of the present invention may include one or more conservative 
3 amino acid substitutions from natural mutations or human manipulation as indicated in Table 3 . 
Mo Changes are preferably of a minor nature, such as conservative amino acid substitutions that do 

(| . not significantly affect the folding or activity of the protein. Residues from the following groups, 
f- k as indicated in Table 3, may be substituted for one another: Aromatic, Hydrophobic, Polar, Basic, 
Acidic, and Small, 

25 Explanation of Table ~4 ' 

Table 4 lists residues comprising antigenic epitopes of antigenic epitope-bearing fragments 
present in each of the full length B. burgdorferi polypeptides described in Table 1 as predicted by 
the inventors using the algorithm of Jameson and Wolf, (1988) Comp. Appl. Biosci. 4:181-186. 
The Jameson-Wolf antigenic analysis was performed using the computer program PROTEAN 

30 (Version 3.1 1 for the Power Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, 
WI). B. burgdorferi polypeptide shown in Table 1 may one or more antigenic epitopes 
comprising residues described in Table 4. It will be appreciated that depending on the analytical 
criteria used to predict antigenic determinants, the exact address of the determinant may vary 
slightly. The residues and locations shown described in Table 4 correspond to the amino acid 

35 sequences for each full length gene sequence shown in Table 1 and in the Sequence Listing. 
. Polypeptides of the present invention that do not have antigenic epitopes recognized by the 
Jameson-Wolf algorithm are not represented in Table 2.. 
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Selection of Nucleic Acid Sequences Encoding Antigenic B. burgdorferi 
Polypeptides 

The present invention provides a select number of ORFs from those presented in the 
fragments of the Borrelia burgdorferi genome which may prove useful for the generation of a 
protective immune response. The sequenced B. burgdorferi genomic DN A was obtained frorn a 
sub-cultured isolate of ATCC Deposit No. 35210. The sub-cultured isolate was deposited on. 
August 8, 1997 at the American Type Culture Collection, 12301 Park Lawn Drive, Rockville, 
Maryland 20852^ and given accession number 202012. 

Some ORFs contained in the subset of fragments of the B. burgdorferi genome disclosed 
herein were derived through the use of a number of screening criteria'detailed below. The ORFs 
are generally bounded at the amino terminus by a methionine residue and at the carboxy terminus, 
by a stop codon. 

Many of the selected sequences do not consist of complete ORFs. Although a polypeptide 
representing a complete ORF may be the closest approximation of a protein native to an organism, 
- it is not always preferred to express a complete ORF in a heterologous system. It may be 
challenging to express and purify a highly hydrophobic protein by common laboratory methods. 
Some of the polypeptide vaccine candidates described herein have been modified slighdy to 
simplify the production of recombinant protein. For example, nucleotide sequences which encode 
highly hydrophobic domains, such as those found at the amino terminal signal sequence, have 
been excluded from some constructs used for in vitro expression of the polypeptides . 
Furthermore, any highly hydrophobic amino acid sequences occurring at the carboxy terminus - 
have also been excluded from the recombinant expression constructs. Thus, in one embodiment, 
a polypeptide which represents a truncated or modified ORF may be used as an antigen. 

While numerous methods are known in the art for selecting potentially immunogenic 
polypeptides, many of the ORFs disclosed herein were selected on the basis of screening all 
theoretical Borrelia burgdorferi ORFs for several aspects of potential immunogenicity. One set of 

selection criteria are as follows: 

1. Type I signal sequence: An amino terminal type I signal sequence generally directs a. 
nascent protein across the plasma and outer membranes to the exterior of the bacterial cell. 
Experimental evidence obtained from studies with Escherichia coli suggests that the typical type I 
signal sequence consists of the following biochemical and physical attributes (Izard, J. W. and 
Kendall, D. A. Mol. Microbiol, 13:1 '65-773 (1994)). The length of the type I signal sequence is 
approximately 15 to 25 primarily hydrophobic amino acid residues with a net positive charge in 
the extreme amino terminus. In addition, the central region of the signal sequence adopts an 
alpha-helical conformation in a hydrophobic environment. Finally, the region surrounding the 
actual site of cleavage is ideally six residues long, with small side-chain amino acids in the -1 and 
-3 positions. 

2. Type TV signal sequence: The type IV signal sequence is an example of the several 
types of functional signal sequences which exist in addition to the type I signal sequence detailed 
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above. Although functionally related, the type IV signal sequence possesses a unique set of 
biochemical and physical attributes (Strom, M. S. and Lory, S., J. Bacteriol. 774:7345-7351 
(1992)). These are typically six to eight amino acids with a net basic charge followed by an 
additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal 
sequence is typically after the initial six to eight amino acids at the extreme amino terminus. In 
addition, type TV signal sequences generally contain a phenylalanine residue at the +1 site relative 
to the cleavage site. 

3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial lipoprotein precursors 
has allowed the definition of a consensus amino acid sequence for lipoprotein cleavage. Nearly 
three-fourths of the bacterial lipoprotein precursors examined contained the sequence L-(A,S)- 
(G.A)-C at positions -3 to +1, relative to the point of cleavage (Hayashi, S. and Wu, H. C, J. 
Bioenerg. Biomembr. 22:451-471 (1990)). . 

4. LPXTG motif: It has been experimentally determined that most anchored proteins 
found on the surface of gram-positive bacteria possess a highly conserved carboxy terminal 
sequence. More than fifty such proteins from organisms such as S. pyogenes, S. mutans, B. 
burgdorferi„S. pneumoniae, and others, have been identified based on their extracellular location 
and carboxy terminal amino acid sequence (Fischetti, V. A., ASM News 62:405-410 (1996)). 
The conserved region consists of six charged amino acids at the extreme carboxy terminus 
coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain. 
Immediately adjacent to the transmembrane domain is a six amino acid sequence conserved in 
nearly all proteins examined. The amino acid sequence of this region is L-P-X-T-G-X, where X 
is any amino acid. " 

An algorithm for selecting antigenic and immunogenic Borrelia burgdorferi polypeptides 
including the foregoing criteria was developed. The algorithm is similar to that described in U.S. 
patent application 08/781,986, filed January 3, 1997, which is fully incorporated by reference 
herein,. Use of the algorithm by the inventors to select immunologically useful Borrelia 
burgdorferi polypeptides resulted in the selection of a number of the disclosed ORFs. 
Polypeptides comprising the polypeptides identified in this group may be produced by techniques 
standard in the art and as further described herein. 

Nucleic Acid Molecules 

The present invention provides isolated nucleic acid molecules comprising polynucleotides 
encoding the B. burgdorferi polypeptides having the amino acid sequences shown in Table 1, 
which were determined by sequencing the genome of B. burgdorferi deposited as ATCC deposit 
no. 202012 and selected as putative immunogens. 

Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA 
'molecule herein were determined using an automated DNA sequencer (such as the Model 373 
from Applied Biosystems, Inc.), and ail amino acid sequences of polypeptides encoded by DNA 
molecules determined herein were predicted by translation of DNA sequences determined as 
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above. Therefore, as is known in the art for any DNA sequence determined by this automated 
approach, any nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, more typically at 
least about 95% to at least about 99.9% identical to the actual nucleotide sequence of the 
sequenced DNA molecule. The actual sequence can be more precisely determined by other, 
approaches including manual DNA sequencing methods well known in the art. As is also known 
in the art, a single insertion or deletion in a determined nucleotide sequence compared to the actual 
sequence will cause a frame shift in translation of the nucleotide sequence such that the predicted 
amino acid sequence encoded by a determined nucleotide sequence will be completely different 
from the amino acid sequence actually encoded by the sequenced DNA molecule, beginning at the 
point of such an insertion or deletion. 

Unless otherwise indicated, each "nucleotide sequence" set forth herein is presented as a 
sequence of deoxyribonucleotides (abbreviated A, G , C and T). However, by "nucleotide 
sequence" of a nucleic acid molecule or polynucleotide is intended, for a DNA molecule or 
polynucleotide, a sequence of deoxyribonucleotides; and for an RNA molecule or polynucleotide, 
the corresponding sequence of ribonucleotides (A, G, C and U), where each thymidine 
deoxyribonucleotide (T) in the specified deoxyribonucleotide sequence is replaced by the 
ribonucleotide uridine (U). For instance, reference to an RNA molecule having a sequence of 
Table 1 set forth using deoxyribonucleotide abbreviations is intended to indicate an RNA molecule 
having a sequence in which each deoxyribonucleotide A, G or C of Table 1 has been replaced by 
the corresponding ribonucleotide A, G or C, and each deoxyribonucleotide T has been replaced 
by a ribonucleotide U. 

Nucleic acid molecules of the present invention may be^gt the form of RNA, such as 
mRNA, or in the form of DNA, including, for instance, cDNA and genomic DNA obtained by 
cloning or produced synthetically. The DNA may be double-stranded or single-stranded. 
Single-stranded DNA or RNA may be .the coding strand, also known as the sense strand, or it 
may be the non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, DNA or RNA, 
which has been removed from its native environment. For example, recombinant DNA molecules 
contained in a vector are considered isolated for the purposes of the present invention. Further 
examples of isolated DNA molecules include recombinant DNA molecules maintained in 
heterologous host cells or purified (partially or substantially) DNA molecules in solution. Isolated 
RNA molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the present 
invention. Isolated nucleic acid molecules according to the present invention further include such 
molecules produced synthetically. 

In addition, isolated nucleic acid molecules of the invention include DNA molecules which 
comprise a sequence substantially different from those described above but which, due to the 
. degeneracy of the genetic code, still encode a S. burgdorferi polypeptides and peptides of the 
present invention (e.g. polypeptides of Table 1). That is, all possible DNA sequences that encode 



the B. burgdorferi polypeptides of the present invention. This includes the genetic code and 
species-specific codon preferences known in the art. Thus, it would be routine for one skilled in 
the art to generate the degenerate variants described above, for instance, to optimize codon 
expression for a particular host (e.g., change codons in the bacteria mRNA to those preferred by a 
mammalian or other bacterial host such as E. coli). 

The invention further provides isolated nucleic acid molecules having the nucleotide 
sequence shown in Table 1 or a nucleic acid molecule having a sequence complementary to one of 
the above sequences. Such isolated molecules, particularly DNA molecules, are useful as probes 
for gene mapping and for identifying B. burgdorferi in a biological sample, for instance, by PCR, 
Southern blot, Northern blot, or other form of hybridization analysis. 

The present invention is further directed to nucleic acid molecules encoding portions or 
fragments of the nucleotide sequences described herein. Fragments include portions of the . 
nucleotide sequences of Table 1 at least 10 contiguous nucleotides in length selected from any two 
integers, one of which representing a 5' nucleotide position and a second/of which representing a 
3* nucleotide position, where the first nucleotide for each nucleotide sequence in Table 1 is 
position 1. That is, every combination of a 5' and 3' nucleotide position that a fragment at least 
10 contiguous nucleotides in length could occupy is included in the invention. "At least" means a 
fragment may be 10 contiguous nucleotide bases in length or any integer between 10 and the 
length of an entire nucleotide sequence of Table 1 minus 1. Therefore, included in the invention 
are contiguous fragments specified by any 5' and 3' nucleotide base positions of a nucleotide 
sequences of Table 1 wherein the contiguous fragment is any integer between 10 and the length of 
an entire nucleotide sequence minus 1. 

Further, the invention includes polynucleotides comprising fragments specified by size, in 
■nucleotides, rather than by nucleotide positions. The invention includes any fragment size, in 
condguous nucleotides, selected from integers between 10 and the length of an entire nucleotide 
sequence minus 1. Preferred sizes of contiguous nucleotide fragments include 20 nucleotides, 30 
nucleotides, 40 nucleotides, 50 nucleotides. Other preferred sizes of contiguous nucleotide 
fragments, which may be useful as diagnostic probes and primers, include fragments 50-300 
nucleotides in length which include, as discussed above, fragment sizes representing each integer 
between 50-300. Larger fragments are also useful according to the present invention 
corresponding to most, if not all, of the nucleotide sequences shown in Table lor of the B. 
burgdorferi nucleotide sequences of the plasimd clones listed in Table 1. The preferred sizes are, 
of course, meant to exemplify" not limit the present invention as all size fragments, representing 
any integer between 10 and the length of an entire nucleotide sequence minus 1, are included in 
the invention. Additional preferred nucleic acid fragments of the present invention include 
nucleic acid molecules encoding epitope-bearing portions of B. burgdorferi polypeptides 
identified in Table 4. 

The present invention also provides for the exclusion of any fragment, specified by 5' and 
3' base positions or by size in nucleotide bases as described above for any nucleotide sequence of 
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Table 1 or the plasimd clones listed in Table 1. Any number of fragments of nucleotide sequences 
in Table 1 or the plasimd clones listed in Table 1, specified by 5' and 3' base positions or by size 
in nucleotides, as described above, may be excluded from the present invention. 

Preferred nucleic acid fragments of the present invention also include nucleic acid 
molecules encoding epitope -bearing portions of the B. burgdorferi polypeptides shown in Table 
1. Such nucleic acid fragments of the present invention include, for example, nucleic acid 
molecules encoding polypeptide fragments comprising from about the amino terminal residue to . 
about the carboxy terminal residue of each fragment shown in Table 4. The above referred to 
polypeptide fragments are antigenic regions of particular B. burgdorferi polypeptides shown in 
Table 1 . Methods for determining other such epitope-bearing portions for the remaining . 
polypeptides described in Table 1 are well known in the art and are described in detail below. 

In another aspect, the invention provides isolated nucleic acid molecules comprising 
polynucleotides which hybridize under stringent hybridization conditions to a portion of a 
polynucleotide in a nucleic acid molecule of the invention described above, for instance, a nucleic 
acid sequence shown in Table 1. By "stringent hybridization conditions" is intended overnight 
incubation at 42 C in a solution comprising: 50% formamide, 5x SSC (150 mM NaCl, 15 mM 
. trisodium citrate), 50 mM sodium phosphate (pH 7,6), 5x Denhardt's solution, 10% dextran 
sulfate, and 20 g/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 
O.lx SSC at about 65 C. 

By polynucleotides which hybridize to a "portion" of a polynucleotide is intended 
polynucleotides (either DNA or RNA) which hybridize to at least about 15 nucleotides (nt), and 
more preferably at least about 20 nt, still more preferably at least about 30 nt, and even more 
preferably about 30-70 nt of the reference polynucleotide. These are useful as diagnostic probes 
and primers as discussed above and in more detail below. _ - '. 

Of course, polynucleotides hybridizing to a larger portion of the reference polynucleotide, 
for instance, a portion 50-100 nt in length, or even to the entire length of the reference 
polynucleotide, are also useful as probes according to the present' invention, as are 
polynucleotides corresponding to most, if not all, of a nucleotide sequence as shown in Table 1 . 
By a portion of a polynucleotide of "at least 20 nt in length," for example, is intended 20 or more 
contiguous nucleotides from the nucleotide sequence of the reference polynucleotide {e.g. , a 
nucleotide sequences as shown in Table 1). As noted above, such portions are useful 
diagnostically either as probes according to conventional DNA hybridization techniques or as 
primers for amplification of a target sequence by PCR, as described, for instance, in Molecular 
Cloning, A Laboratory Manual, 2nd. edition, Sambrook, J., Fritsch, E. F. and Maniatis, T., 
eds., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989), the entire 
disclosure of which is hereby incorporated herein by reference. 

Since nucleic acid sequences encoding the B. burgdorferi polypeptides of the present 
invention are provided in Table 1, generating polynucleotides which hybridize to portions of these 
sequences would be routine to the skilled artisan. For example, the hybridizing polynucleotides 
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of the present invention could be generated synthetically according to known techniques. 

As indicated, nucleic acid molecules of the present invention which encode B. burgdorferi 
polypeptides- of the present invention may include, but are not limited to those encoding the amino 
acid sequences of the polypeptides by themselves; and additional coding sequences which code 
for additional amino acids, such as those which provide additional functionalities. Thus, the 
sequences encoding these polypeptides may be fused to a marker sequence, such as a sequence 
encoding a peptide which facilitates purification of the fused polypeptide. In certain preferred 
embodiments of this aspect of the invention, the marker amino acid sequence is a hexa-histidine 
peptide, such as the tag provided in a pQE vector (Qiagen, Inc.), among others, many of which 
are commercially available. As described in Gentz et al, Proc. Natl. Acad. Sci. USA 56:821-824 
(1989), for instance, hexa-histidine provides for convenient purification of the resulting fusion 
protein. 

Thus, the present invention also includes genetic fusions wherein the 
B. burgdorferi nucleic acid sequences coding sequences provided in Table 1 are linked to 
additional nucleic acid-sequences to produce fusion proteins. These fusion proteins may include 
epitopes of borrelial or non-borrelial origin designed to produce proteins having enhanced 
immunogenicity, Further, the fusion proteins of the present invention may contain antigenic - 
determinants known to provide helper T-cell stimulation, peptides encoding sites for 
post-translational modifications which enhance immunogenicity {e.g., acylation), peptides which 
facilitate purification (<?.g.,-histidine "tag"), or amino acid sequences which target the fusion 
protein to a desired location {e.g., a heterologous leader sequence). For instance, hexa-histidihe 
provides for convenient purification of the fusion protein. See Gentz et al. (1989) Proc. Nad. 
Acad. Sci. 86:821-24. The "HA" tag is another peptide useful for purification which corresponds 
to an epitope derived from the influenza hemagglutinin protein. See Wilson et al. (1984) Cell . 
37:767. As discussed below, other such fusion proteins include the 5. burgdorferi polypeptides 
of the present invention fused to Fc at the N- or C-terminus. 

Post-translational modification of the full-length B. burgdorferi OspA protein 
expressed in E. coli is believed to increase the immunogenicity of this protein. Erdile, L: et al, 
Infect. Immun. 67:81-90 (1993). B. burgdorferi OspA when expressed in E. coli, for example, 
is post-translationally modified in at least two ways. First, a signal peptide is cleaved; second, 
lipid moieties are attached. The presence of these lipid moieties is believed to confer enhanced 
immunogenicity and results in the elicitation of a strong protective immunological response. 

Variant and Mutant Polynucleotides 

The present invention thus includes nucleic acid molecules and sequences which encode 
fusion proteins comprising one or more B. burgdorferi polypeptides of the present invention 
fused to an amino acid sequence which allows for post-translational modification to enhance 
immunogenicity. This post-translational modification may occur either in vitro or when the fusion 
protein is expressed in vivo in a host cell. An example of such a modification is the introduction 
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of an amino acid sequence which results in the attachment of a lipid moiety. Such a lipid moiety 
attachment site of OspA, which is lipidated upon expression in E. coli, has been identified. 
Bouchon, B. et aL, Anal. Biochem. 246:52-61 (1997). 

Thus, as indicated above, the present invention includes genetic fusions wherein a 
B. burgdorferi nucleic acid sequence provided in Table 1 is linked to a nucleotide sequence 
encoding another amino acid sequence. These other amino acid sequences may be of borrelial 
origin {e.g., another sequence selected from Table 1) or non-borrelial origin: An example of such 
a- fusion protein is reported in Fikrig, E. et aL, Sconce. 250:553-556 (1990) where an OspA- 
glutathione-S-transferase fusion protein was produced and shown to elicit protective immunity 
against Lyme disease in immune competent mice. 

The present invention further relates to variants of the nucleic acid molecules of the present 
invention, which encode portions, analogs or derivatives, of the B. burgdorferi polypeptides 
shown in Table 1. Variants may occur naturally, such as a natural allelic variant. By an "allelic - 
variant" is intended one of several alternate forms of a gene occupying a given locus on a 
chromosome of an organism. Genes 77, Lewin, B., ed., John Wiley & Sons, New York (1985). 
Non-naturally occurring variants may be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, deletions or additions. 
The substitutions, deletions or additions may involve one or more nucleotides. These variants 
may be altered in coding regions, non-coding regions, or both. Alterations in the coding regions 
may produce conservative or non-conservative amino acid substitutions, deletions or additions. 
Especially preferred among these are silent substitutions, additions and deletions, which do not 
alter the properties- and activities of the B. burgdorferi polypeptides disclosed herein or portions 
thereof. Also especially preferred in this regard are conservative substitutions. 

The present application is further directed to nucleic acid molecules at least 90%, 95%, 
96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in Table 1. The above 
nucleic acid sequences are included irrespective of whether they encode a polypeptide having B. 
burgdorferi activity. This is. because even -where a particular nucleic acid molecule does not , 
encode a polypeptide having B. burgdorferi activity, one of skill in the art would still know how 
to use the nucleic acid molecule, for instance, as a hybridization probe. Uses of the nucleic acid 
molecules of the present invention that do not encode a polypeptide having 5. burgdorferi activity 
include, inter alia, isolating an B. burgdorferi gene or allelic variants thereof from a DNA library, 
and detecting B. burgdorferi mRNA expression samples/environmental samples, suspected of • 
containing B. burgdorferi by Northern Blot analysis. , 

Embodiments of the invention include isolated nucleic acid molecules comprising a . 
polynucleotide having a nucleotide sequence at least 90% identical, and more preferably at least 
95%, 96%, 97%, 98% or 99% identical to- (a) a nucleotide sequence encoding any of the amino- 
acid sequences of the full-length polypeptides shown in Table 1; (b) a nucleotide sequence 
encoding any of the amino acid sequences of the full-length polypeptides shown in Table 1 but 
minus the N-terminal methionine residue, if present; (c) a nucleotide sequence encoding any of the 
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amino acid sequences of the truncated polypeptides shown in Table 1 ; and (d) a nucleotide 
sequence complementary to any of the nucleotide sequences in (a), (b), or (c) above. 

Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 97%, 
98% or 99% identical to the nucleic acid sequence shown in Table 1, which do, in fact, encode a 
polypeptide having 5. burgdorferi protein activity By "a polypeptide having B. burgdorferi 
activity" is intended polypeptides exhibiting activity similar, but not necessarily identical, to an 
activity of the B. burgdorferi protein of the invention, as measured in a particular biological assay 
suitable for measuring activity of the specified protein. 

Due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately 
recognize that a large number of the nucleic acid molecules having a sequence at least 90%, 95%, 
96%, 97%, 98%, or 99% identical to the nucleic acid sequences shown in Table 1 will encode a 
polypeptide having B. burgdorferi protein activity. In fact, since degenerate variants of these 
nucleotide sequences all encode the same polypeptide, this will be clear to the skilled artisan even 
without performing the above described comparison assay. It will be further recognized in the: art 
that, for such nucleic acid molecules that are not degenerate variants, a reasonable number will 
also encode a polypeptide having B. burgdorferi protein activity. This is because the skilled 
artisan is fully aware of amino acid substitutions that are either less likely or not likely to 
significantly effect protein function (e.g., replacing one aliphatic amino acid with a second 
aliphatic amino acid), as further described below. 

The biological activity or function of the polypeptides of the present invention are expected 
to be similar or identical to polypeptides from other bacteria that share a high degree of structural 
identity /similarity. Tables 2 lists accession numbers and descriptions for the closest matching 
sequences of polypeptides available through Genbank and Derwent databases. It is therefore 
expected that the biological activity or function of the polypeptides of the present invention will be 
similar or identical to those polypeptides from other bacterial genuses, species, or strains listed in 
Table 2. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% "identical" to 
a reference nucleotide sequence of the present invention, it is intended that the nucleotide sequence 
of the polynucleotide is identical to the reference sequence except that the polynucleotide sequence 
may include up to five point mutations per each 100 nucleotides of the reference nucleotide 
sequence encoding the B, burgdorferi polypeptide. In other words, to obtain a polynucleotide 
having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% 
(5 of 100) of the nucleotides in the reference sequence may be deleted, inserted, or substituted 
with another nucleotide. The query sequence may be an entire sequence shown in Table. 1, the 
ORF (open reading frame), or any fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide is at least 
90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the presence invention 
can be determined conventionally using known computer programs. A preferred method for 
determining the best overall match between a query sequence (a sequence of the present invention) 
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and a subject sequence, also referred to as a global sequence alignment, can be determined using 
the FASTDB computer program based on the algorithm of Brutlag et al. See Brutlag et al. 
(1990) Comp. App. Biosci. 6:237-245. In a sequence .alignment the query and subject^sequences 
are both DNA sequences. An RNA sequence can be compared by first converting U's to Ts. 
The result of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB alignment of DNA sequences to calculate percent identity are: Ma'trix=Unitary, k- 
tuple=4, Mismatch Penalty=l, Joining Penalty=30, Randomization Group Length=0, Cutoff 
Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the subject 
nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3' deletions, 
not because of internal deletions, a manual correction must be made to the results. This is because 
. the FASTDB program does not account for 5' and 3' truncations of the subject sequence when 
calculating percent identity. .For subject sequences truncated at the 5' or 3' ends, relative to the 
query sequence, the percent identity is corrected by calculating the number of bases of the query 
sequence that are 5 J and 3' of the subject sequence, which are not matched/aligned, as a percent of 
the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB. program using the specified parameters, to arrive at a 
final percent identity score. This corrected score is what is used for the purposes of the present 
invention. Only nucleotides outside the 5' and 3* nucleotides of the subject sequence, as 
displayed by the FASTDB alignment, which are not matched/aligned with the query sequence, are 
calculated for the purposes of manually adjusting the percent identity score. 

For example, a 90, nucleotide subject sequence is aligned to a 100 nucleotide query 
sequence to determine percent identity. The deletions occur at the 5' end of the subject sequence 
and therefore, the FASTDB alignment does not show a 'matched/alignment of the first 10 
nucleotides at 5' end. The 10 unpaired nucleotides represent 10% of the sequence (number of 
nucleotides at the 5' and 3' ends not matched/total number of nucleotides in the query sequence) 
so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the 
remaining 90 nucleotides were perfectly matched the final percent identity would be 90%. In 
another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide query 
sequence. This time the deletions are internal deletions so that there are no nucleotides on the. 5' 
or 3' of the subject sequence which are not matched/aligned with the query.. In this case the 
percent identity calculated by FASTDB is not manually corrected. Once again, only nucleotides 
5' and 3' of the subject sequence which are not matched/aligned with the query sequence are 
manually corrected for. No other manual corrections are to made for the purposes of the present 
invention^ 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated DNA molecules of 
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above-described host cells are known in the art. ' 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, available 
from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, 
pNH46A available from Stratagene; pET series of vectors available from Novagen; and ptrc99a, 
pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. Among preferred eukaryotic 
vectors are pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and 
pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other suitable vectors will be 
readily apparent to the skilled artisan. . , 

Among known bacterial promoters suitable for use in the present invention include the E. 
coli lad and lacZ promoters, the T3 arid T7 promoters, the gpt promoter, the lambda PR and PL- 
promoters and the trp promoter. Suitable eukaryotic promoters include the CMV immediate early 
promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters 
of retroviral LTRs, such as -those of the Rous sarcoma virus (RS V), and metallothionein 
promoters, such as the mouse metallothionein-I promoter. . 

Introduction of the construct into the host cell can be effected by calcium phosphate 
transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, 
electroporation, transduction, infection or other methods. Such methods are described in many 
standard laboratory manuals, such as Davis et al, Basic Methods In Molecular Biology (1986). 

Transcription of DNA encoding the polypeptides of the present invention by higher 
eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers are 
cij-acting elements of DNA, usually about from 10 to 300 bp that act to increase transcriptional 
abtivity of a promoter in a given host cell-type. Examples of enhancers include the SV40 
enhancer, which is located on the late side of the replication origin at bp 100 to 270, the v 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication 
origin, and adenovirus enhancers. 

For secretion of the translated polypeptide into the lumen of the endoplasmic reticulum, 
into the periplasmic space or into the extracellular environment, appropriate secretion signals may 
be incorporated into the expressed polypeptide. The signals may -be endogenous to the 
polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as a fusion protein, and may 
include not only secretion signals, but also additional heterologous functional regions. For 
instance, a region of additional amino acids, particularly charged amino acids, may be added to . 
the N-terminus of the polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties may be added to' 
the polypepdde to facilitate purification. Such regions may be removed prior to final preparation 
of the polypepdde. The addition of peptide moiedes to polypeptides to engender secretion or 
excretion, to improve stability and to facilitate purification, among others, are familiar and routine 
techniques in the art. A preferred fusion protein comprises a heterologous region from 
immunoglobulin that is useful to solubilize proteins. For example, EP-A-O 464 533 (Canadian 
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the present invention, host cells which are genetically engineered with the recombinant vectors, 
and the production of B. burgdorferi polypeptides or fragments thereof by recombinant 
techniques. 

Recombinant constructs may be introduced into host cells using well known techniques 
such as infection, transduction, transfection, transvection, electroporation and transformation. 
The vector may be, for example, a phage, plasmid, viral or retroviral vector. Retroviral vectors 
may be replication competent or replication defective. In the latter case, viral propagation . 
generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a calcium 
phosphate precipitate, or in a complex with a charged lipid. If the vector is a virus, it may be 
packaged in vitro using an appropriate packaging cell line and then transduced into host cells. 

Preferred are vectors comprising czs-acting control regions to the polynucleotide of 
interest. Appropriate trans-acting factors may be supplied by the host, supplied by a 
complementing vector'or supplied by the vector itself upon introduction into the host. 

In certain preferred embodiments in this regard, the vectors provide for specific f 
expression, which may be inducible and/or cell typerspecific. Particularly preferred among such 
vectors are those inducible by environmental factors that are easy to manipulate, such as 
temperature and nutrient additives. 

Expression vectors useful in- the present invention include chromosomal-, episomal- and 
virus-derived vectors, e.g., vectors derived from bacterial plasmids, bacteriophage, yeast 
episomes, yeast chromosomal elements, viruses such as baculoviruses, papova viruses, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and. vectors 
derived from combinations thereof, such as cosmids and phagemids. 

The DNA insert should be operatively linked to an appropriate promoter, such as the 
phage lambda PL promoter, the E. coli lac, trp and tac promoters, the SV40 early and late 
promoters and promoters of retroviral LTRs, to name, a few. Other suitable promoters will be 
known to the skilled artisan. The expression constructs will further contain sites for transcription 
initiation, termination and, in the transcribed region, a ribosome binding site for translation. The 
coding portion of the mature transcripts expressed byjthe constructs will preferably include a 
translation initiating site at the beginning and a termination codon (UAA, UGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one selectable marker. 
Such markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture 
and tetracycline or ampicillin resistance genes for culturing in E. coli and other bacteria. 
Representative examples of appropriate hosts include, but are not limited to, bacterial ceils, such 
as E. coli, Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and 
Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions for the 
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counterpart 2045869) discloses fusion proteins comprising various portions of constant region of 
immunoglobin molecules together with another human protein or part thereof. In many cases, the 
Fc part in a fusion protein is thoroughly advantageous for use in therapy and diagnosis and thus 
results, for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 

5 hand, for some uses it would be desirable to be able to delete the Fc part after the fusion protein 
has been expressed, detected and purified in the advantageous manner described. This is the case 
when Fc portion proves to be a hindrance to use in therapy and diagnosis, for example when the 
fusion protein is to be used as antigen for immunizations. In drug discovery, for example, human 
proteins, such as, hIL5-receptor has been fused with Fc portions for the purpose of 

10 high-throughput screening assays to identify antagonists of hIL-5. See Bennett, D. et aL, J. 
Molec, Recogn. 5:52-58 (1995) and Johanson, K. etal, J. Biol Chem. 270 (7<5J:9459-9471 

□ (1995). . 

ip The B. burgdorferi polypeptides can be recovered and purified from recombinant cell 

M cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid 
(3fe extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
i|j hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 

□ chromatography, lectin chromatography and high performance liquid chromatography ("HPLC") 
^ is employed for purification. Polypeptides of the present invention include naturally purified 

|l products, products of chemical synthetic procedures, and products produced by recombinant 

techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher 
?wj plant, insect and mammalian cells. - 

Polypeptides and Fragments 

The invention further provides isolated polypeptides having the amino acid sequences in 

25 Table 1, and peptides or polypeptides comprising portions of the above polypeptides. The terms 
- "peptide", and "oligopeptide" are considered synonymous (as is commonly recognized) and each 
term can be used interchangeably as the context requires to indicate a chain of at least to amino 
acids coupled by peptidyl linkages. The word "polypeptide" is used herein for chains containing 
more than ten amino acid residues. All oligopeptide and polypeptide formulas or sequences 

30 herein are written from left to right and in the direction from amino terminus to carboxy terminus. 

As discussed in detail below, immunization using B. burgdorferi sensu stricto isolate B31 
decorin-binding protein elicits the production of antiserum which confers passive immunity - 
against Borrelia species and strains which express divergent forms of this protein.. Cassatt, D. et 
al, Protection of Borrelia burgdorferi Infection by Antibodies to Decorin-binding Protein, in 

35 Vaccines97, Cold Spring Harbor Press. (1997), pages- 191-195. Thus, some amino acid 
sequences of the B, burgdorferi polypeptides shown in Table 1 can be varied without 
significantly effecting the antigenicity of the polypeptides. If such differences in sequence are 
contemplated, it should be remembered that there will be critical areas on the polypeptide which 
determine antigenicity. In general, it is possible to replace residues which do not form' part of an 
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antigenic epitope without significantly effecting the antigenicity of a polypeptide. 
Variant and Mutant Polypeptides 

To improve or alter the characteristics of B. burgdorferi polypeptides of the present 
invention, protein engineering may be employed. Recombinant DNA technology known to those 
skilled in the art can be used to create novel mutant proteins or muteins including single or 
multiple amino acid substitutions, deletions, additions, or fusion proteins. Such modified 
polypeptides can show, e.g., enhanced activity or increased stability. In addition, they may be 
purified in higher yields and show better solubility than the corresponding natural polypeptide, at 
least under certain purification and storage conditions. , 

N -Terminal and C-Terminal Deletion Mutants 

It is known in the art that one or more amino acids may be deleted from the N-terminus or 
C-terminus without substantial loss of biological function. For instance, Ron et al. J. Biol. 
Chem., 268:2984-2988 (1993), reported modified KGF proteins that had heparin binding 
activity even if 3, 8, or 27 N-terminal amino acid residues were missing. Accordingly, the 
present invention provides polypeptides having one or more residues deleted from the amino 
terminus of the amino acid sequence of the B: burgdorferi polypeptides shown in Table 1, and 
polynucleotides encoding such polypeptides. 

Similarly, many examples of biologically functional C-terminal deletion muteins are 
known. For instance, Interferon gamma shows up to ten times higher activities by deleting 8-10 
amino acid residues from the carboxy terminus of the protein See, e.g., Dobeli, et al. (1988) J. 
Biotechnology 7:199-216. Accordingly, the present invention provides polypeptides having one 
or more residues from the carboxy terminus of the amino acid sequence of the £. burgdorferi 
polypeptides shown in Table L The invention also provides polypeptides having one or more 
amino acids deleted from both the amino and the carboxyl termini as described below. 

. The present invention is further directed to polynucleotide encoding portions or fragments 
of the amino acid sequences described herein as well as to portions or fragments of the isolated 
amino acid sequences described herein. Fragments include portions of the amino acid sequences 
of Table 1, are at least 5 contiguous amino acid in length, are selected from any two integers, one 
of which representing a N-terminal position. The initiation codon of the polypeptides of the 
present inventions position 1. Every combination of a N-terminal and C-terminal position that a 
fragment at least 5 contiguous amino acid residues in length could occupy, on any given amino 
acid sequence of Table 1 is included in the invention. At least means a fragment may be 5 
contiguous amino acid residues in length or any integer between 5 and the number of residues. in a 
full length amino acid sequence minus 1. Therefore,' included in the invention are contiguous 
fragments specified by any N-terminal and C-terminal positions of amino acid sequence set forth 
in Table 1 w herein the contiguous fragment is any integer between 5 and the number of residues 
in a full length sequence minus 1 . 

Further, the invention includes polypeptides comprising fragments specified by size, in 
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amino acid residues, rather than by N-terminal and C-terminal positions. The invention includes 

any fragment size, in contiguous amino acid residues, selected from integers between 5 and the 
number of residues in a full length sequence minus 1. Preferred sizes of contiguous polypeptide 
fragments include about 5 amino acid residues, about 10 amino acid residues, about 20 amino 
acid residues, about 30 amino acid residues, about 40 amino acid residues, about 50 amino acid 
residues, about 100 amino acid residues, about 200 amino acid residues, about 300 amino acid 
residues, and about 400 amino acid residues. The preferred sizes are, of course, meant to 
exemplify, not limit, the present invention as all size fragments representing any integer between 5 
and the number of residues in a full length sequence minus 1 are included in the invention. The 
present invention also provides for the exclusion of any fragments specified by N-terminal and C- 
terminal positions or by size in amino acid residues as described above. Any number of 
fragments specified by N-terminal and C-terminal positions or by size in amino acid residues as 
described above may be excluded. 

The above fragments need not be active since they would be useful, for example, in 
immunoassays, in epitope mapping, epitope tagging, to generate antibodies to a particular portion 
of the protein, as vaccines, and as molecular weight markers. 

Other Mutants 

In addition to N- and C-terminal deletion forms of the protein discussed above, it also will 
be recognized by one of ordinary skill in the art that some amino acid sequences of the B. 
burgdorferi polypeptide can be varied without significant effect of the structure or function of the 
protein. If such differences in sequence are contemplated, it should be remembered that there will 
be critical areas on the protein which determine activity. 

Thus, the invention further includes variations of the B. burgdorferi polypeptides which 
show substantial B. burgdorferi polypeptide activity or which include regions of B. burgdorferi 
protein such as the protein portions discussed below. Such mutants include deletions, insertions, 
inversions, repeats, and type substitutions selected according to general rules known in the art so 
as to have little effect on activity. For example, guidance concerning how to make phenotypically 
silent amino acid substitutions is provided. There are two main approaches for studying the 
tolerance of an amino acid sequence to change". See, Bowie, J. U. et al. (1990), Science 
247:1306-1310. The first method relies on the process of evolution, in which mutations are either 
accepted or rejected by natural selection. The second approach uses genetic engineering to 
introduce amino acid changes at specific positions of a cloned gene and selections or screens to 
identify sequences that maintain functionality. . . 

These studies have revealed that proteins are surprisingly tolerant of amino acid 
• substitutions. The studies indicate which amino acid changes are likely to be permissive at a 
certain position of the protein. For example, most buried amino acid residues require nonpolar 
side chains, whereas few features of surface side chains are generally conserved. Other such 
phenotypically silent substitutions are described by Bowie et al. (supra) and the references cited 
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therein. Typically seen as conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu and lie; interchange of the hydroxyl residues Ser 
and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues 
Asn and Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic 
residues Phe, Tyr. 

Thus, the fragment; derivative, analog, or homolog of the polypeptide of Table 1, or that 
encoded by the plaimds listed in Table 1, may be: (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not be one 
encoded by the genetic code: or (ii) one in which one or more of the amino acid residues includes 
a substituent group: or (iii) one in which the B. burgdorferi polypeptide is fused with another 
compound, such as a compound to increase the half-life of the polypeptide (for example, 
polyethylene glycol): or (iy) one in which the additional amino acids are fused to the above form - 
of the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory sequence or a 
sequence which is employed for purification of the above form of the polypeptide or a proprotein 
sequence. Such fragments, derivatives and analogs aire deemed to be within the scope of those 
skilled in the art from the teachings herein. 

Thus, the B. burgdorferi polypeptides of the present invention may include one or more 
amino acid substitutions, deletions, or additions, either from natural mutations or human 
manipulation. As indicated, changes are preferably of a minor nature, such, as conservative amino 
acid substitutions that do not significantly affect the folding or activity of the protein (see Table 3). 

Amino acids in the B. burgdorferi proteins of the present invention that are essential for 
function can be identified by methods known in the art, such as site-directed mutagenesis or 
alanine-scanning mutagenesis. See, e.g., Cunningham et al. (1989) Science. 244:1081-1085. 
The latter procedure introduces single alanine mutations. at every residue in the molecule. The 
resulting mutant molecules are then tested for biological activity using assays appropriate for 
measuring the function of the particular protein. 

Of special interest are substitutions of charged amino acids with other charged or neutral 
amino acids which may produce proteins with highly desirable improved characteristics, such as 
less aggregation. Aggregation may not only reduce activity but also be problematic when 
preparing pharmaceutical formulations, because aggregates can be immunogenic. See, e.g., 
Pinckard et al., (1967) Clin. Exp. Immunol. 2:331-340; Robbins, et al., (1987) Diabetes 36:838- 
845; Cleland, et al., (1993) Crit. Rev. Therapeutic Drug Carrier Systems 10:307-377. 

The polypeptides of the present invention are preferably provided in an isolated form, and 
preferably are substantially purified. A recombinantly produced version of the B. burgdorferi 
polypeptide can be substantially purified by the one-step method described by Smith et al. (1988) 
Gene 67:31-40. Polypeptides of the invention also can be purified from natural or recombinant 
sources using antibodies directed against the polypeptides of the invention in methods which are 
well known in the art of protein purification. 
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The invention further provides for isolated B. burgdorferi polypeptides comprising an 
amino acid sequence selected from the group consisting of : (a) the amino acid sequence of a full- 
length B. burgdorferi polypeptide having the complete amino acid sequence shown in Table 1; 
(b) the amino acid sequence of a full-length B. burgdorferi polypeptide having the complete amino 
acid sequence shown in Table 1 excepting the N-terminal methionine; (c) the complete amino acid 
sequence encoded by the plaimds listed in Table 1 ; and (d) the complete amino acid sequence 
excepting the N-terminal methionine encoded by the plaimds listed in Table 1. The polypeptides 
of the present invention also include polypeptides having an amino acid sequence at least 80% 
identical, more preferably at least 90% identical, and still more preferably 95%, 96%, 97%, 98% 
or 99% identical to those described in (a), (b), (c), and (d) above. 

Further polypeptides of the present invention include polypeptides which have at least 
90% similarity, more preferably at least 95% similarity, and still more preferably at least 96%, 
97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which comprises the amino 
acid sequence of a B. burgdorferi polypeptide having an amino acid sequence which contains at 
least one conservative amino acid substitution, but not more than 50 conservative amino acid 
substitutions, not more than 40 conservative amino acid substitutions, not more than 30 
conservative amino acid substitutions, and not more than 20 conservative amino acid 
substitutions. Also provided are polypeptides which comprise the amino acid sequence of a B. 
burgdorferi polypeptide, having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 
conservative amino acid substitutions. 

By a polypeptide having an amino acid sequence at least, for example, 95% "identical" to 
a query amino acid sequence of the present invention, it is intended that the amino acid sequence 
of the subject polypeptide is identical to the query sequence except that the subject polypeptide 
sequence may include up to five amino acid alterations- per each 100 amino acids of the query 
amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at 
least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues in the 
subject sequence may be inserted, deleted, (indels) or substituted with another amino acid. These 
alterations of the reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, interspersed either 
individually among residues in the reference sequence or in one or more contiguous groups within 
the reference sequence. ^ 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 96%, 97%, 
98% or 99% identical to, for instance, the amino acid sequences shown in Table 1 or to the amino 
acid sequence encoded by the plaimds listed in Table" 1 can be determined conventionally using 
known computer programs. A preferred method for determining the best overall match between a , 
query sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brudag et al., (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the 
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query and subject sequences are both amino acid sequences. The result of said global sequence 
alignment is in percent identity. Preferred parameters used in a FASTDB amino acid alignment 
are: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining Penalty =20, Randomization Group 
Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size 
Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, whichever is 
shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-terminal 
deletions, not because of internal deletions, the results, in percent identity, must be manually 
corrected. This is because the FASTDB program does not account for N- and C-terminal 
truncations of the subject sequence when calculating global percent identity. For subject 
sequences truncated at the N- and C-teimini, relative to the query sequence, the percent identity is 
corrected by calculating the number of residues of the query sequence that are N- and C-terminal 
of the subject sequence, which. are not matched/aligned with a corresponding subject residue, as a 
percent of the total bases of the query sequence. Whether a residue is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then subtracted 
from the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This final percent identity score is what is 
used for the purposes of the present invention. Only residues to the N- and C-termini of the 
subject sequence, which are not matched/aligned with the query sequence, are considered for the 
purposes of manually adjusting the percent identity score. That is, only query amino acid 
residues outside the farthest N- and C-terminal residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 residue query 
sequence to determine percent identity. The deletion occurs at the N-terminus of the subject 
sequence and therefore, the FASTDB alignment does not match/align with the first 10 residues at 
the N-terminus. The 10 unpaired residues represent 10% of the sequence (number of residues at 
the N- and C- termini not matched/total number of residues in the query sequence) so 10% is 
subtracted from the percent identity score calculated by the FASTDB program. If the remaining 
90 residues were perfectly matched the final percent identity would be 90%. In another example, 
a 90 residue subject sequence is compared with a 100 residue query sequence. This time the 
deletions are internal ,so there are no residues at the N- or C-termini of the subject sequence which 
are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is 
not manually corrected. Once again, only residue positions outside the N- and C-terminal ends of 
the subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned 
with the query sequence are manually corrected. No other manual corrections are to made for the 
purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they have their 
normal biological activity. This is because even where a particular polypeptide molecule does not 
have biological activity, one of skill in the art would still know how to use the polypeptide, for 
instance, as a vaccine or to generate antibodies. Other uses of the polypeptides of the present 
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invention that do not have B. burgdorferi activity include, inter alia, as epitope tags, in epitope 
mapping, and as molecular weight markers on SDS-PAGE gels or on molecular sieve gel 
filtration columns using methods known to those of skill in the art. 

- As described below, the polypeptides of the present invention can also be used to raise 
polyclonal and monoclonal antibodies, which are useful in assays for detecting B. burgdorferi 
protein expression or as agonists and antagonists capable of enhancing or inhibiting B. 
burgdorferi protein function. Further, such polypeptides can be used in the yeast two-hybrid 
system to "capture" B. burgdorferi protein binding proteins which are also candidate agonists and 
antagonists according to the present invention. See, e.g., Fields et al. (1989) Nature - 
340:245-246. . . 

Epitope-Bedring Portions 

In another aspect, the invention- provides peptides and polypeptides comprising 
epi tope-bearing portions of the B. burgdorferi polypeptides of the present invention. These 
epitopes are immunogenic or antigenic epitopes of the polypeptides of the present invention. An ~ 
"immunogenic epitope" is defined as a part of a protein that elicits an antibody response when the 
whole protein or polypeptide is the "immunogen. These immunogenic epitopes are believed to be 
confined to a few loci on the molecule. On the other hand, a region of a protein molecule to 
which an antibody can bind is defined as an "antigenic determinant" or "antigenic epitope." The 
number of immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, e.g., Geysen, et al. (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002. 
Predicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 4 only lists 
amino acid residues comprising epitopes predicted to have the highest degree of antigenicity. The 
polypeptides not listed in Table 4 and portions of polypeptides not listed in Table 4 are not 
considered non-antigenic. Thisls because they may still be antigenic in vivo but merely not 
recognized as such by the particular algorithm used. Thus, Table 4 lists the amino acid residues 
comprising preferred antigenic epitopes but not a complete list. Amino acid residues comprising 
other anigenic epitopes may be determined by algorithms similar to the Jameson-Wolf analysis or 
by in vivo testing for an antigenic response using the methods described herein or those known in 
the art. ; ■ - - ; . - 

As to the selection of peptides or polypeptides bearing an antigenic epitope (i.e., that 
contain a region of a protein molecule to which an antibody can bind), it is well known in that art 
that relatively short synthetic peptides that mimic part of a protein sequence are routinely capable 
of eliciting an antiserum that reacts with the partially mimicked protein. See, e.g., Sutcliffe, et 
al., (1983) Science 219:660-666. Peptides capable of eliciting protein-reactive sera are frequendy 
represented in the primary sequence of a protein, can be characterized by a set of simple chemical 
rules, and are confined neither to immunodominant regions of intact proteins (i.e. , immunogenic 
epitopes) hor to the amino or carboxyl terminals. Peptides that are extremely hydrophobic and 
those of six or fewer residues generally are ineffective at inducing antibodies that bind to the 



mimicked protein; longer, peptides, especially those containing proline residues, usually are 
effective. See, Sutcliffe, et al., supra, p. 661. For instance, 18 of 20 peptides designed 
according to these guidelines, containing 8-39 residues covering 75% of the sequence of the 
influenza virus hemagglutinin HA1 polypeptide chain, induced antibodies that reacted with the 
HAl.protein or intact virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the 
rabies glycoprotein induced antibodies that precipitated the respective proteins. 

- Antigenic epitope-bearirig peptides and polypeptides of the invention are. therefore useful 
to raise antibodies, including monoclonal antibodies, that bind specifically to a polypeptide of the 
invention. Thus, a high proportion of hybridomas obtained by fusion of spleen cells from donors 
immunized with an antigen epitope-bearing peptide generally secrete antibody reactive with the 
native protein. See Sutcliffe, et al., supra, p. 663. The antibodies raised by antigenic 
epitope-bearing peptides or polypeptides are useful to detect the mimicked protein, and antibodies . 
to different peptides may be used for tracking the fate of various regions of a protein precursor 
which undergoes post-translational processing. The peptides and anti-peptide antibodies may be 
used in a variety of qualitative or quantitative assays for the mimicked protein, for instance in 
competition assays since it has been shown that even short peptides {e.g., about 9 amino acids) 
can bind and displace the larger peptides in immunoprecipitation assays. See, e.g., Wilson, et 
al., (1984) Cell 37:7 67 '-778. The anti-peptide antibodies of the invention also are useful for 
purification of the mimicked protein, for instance, by adsorption chromatography using methods 
known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention designed according 
to the above guidelines preferably contain a sequence of at least seven, more preferably at least 
nine and most preferably between about 10 to about 50 amino acids (i.e. any integer between 7 
and 50) contained within the amino acid sequence of a polypeptide of the invention. However, 
peptides or polypeptides comprising a larger portion of an amino acid sequence of a polypeptide 
of the invention, containing about 50 to about 100 amino acids, or any length up to and including 
the entire amino acid sequence of a polypeptide of the invention, also are considered 
epitope-bearing peptides or polypeptides of the invention and also are useful for inducing 
antibodies that react with the mimicked protein. Preferably, the amino acid sequence of the 
epitope-bearing peptide is selected to provide substantial solubility in aqueous solvents (i.e., the 
sequence includes relatively hydrophilic residues and highly hydrophobic sequences are 
preferably avoided); and sequences containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be used to generate 
an Borrelia-specific immune response or antibodies include portions of the amino acid sequences 
identified in Table 1. More specifically, Table 4 discloses a list of non-limiting residues that are 
involved in the antigenicity of the epitope-bearing fragments of the present invention. Therefore, 
the present inventions provides for isoiatd and purified antigenic epitope-bearing. fragements of 
the polypeptides of the present invention comprising a peptide sequences of Table 4. The 
antigenic epitope-bearing fragments comprising a peptide sequence of Table 4 preferably contain a 
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sequence of at least seven, more preferably at least nine and most preferably between about 10 to 
about 50 amino acids (i.e. any integer between 7 and 50) of a polypeptide of the present 
invention. That is, included in the present invention are antigenic polypeptides between the 
integers of 7 and 50 amino acid in length comprising one or more of the sequences of Table 4. 
Therefore, in most cases, the polypeptides of Table 4 make up only a portion of the antigenic 
polypeptide. All combinations of sequences between the integers of 7 and 50 amino acid in length 
comprising one or more of the sequences of Table 4 are included. The antigenic epitope -bearing 
fragements may be specified by either the number of contiguous aminp acid residues or by 
specific N-terminal and C-terminal positions as described above for the polypeptide fragements of 
the present invention, wherein the initiation codon is residue 1. Any number of. the described 
antigenic epitope-bearing fragements of the present invention may also be excluded from the 
present invention in the same manner. 

The epitope-bearing peptides and polypeptides of the invention may be produced by any 
conventional means for making peptides or polypeptides including recombinant means using 
nucleic acid molecules" of the' invention. For instance, an epitope-bearing amino acid sequence, of 
the present invention may be fused to a larger polypeptide which acts as a carrier during 
recombinant production and purification, as well as during immunization to produce anti-peptide 
antibodies. Epitope-bearing peptides also may be synthesized using known methods of chemical 
synthesis. For instance, Houghten has described a simple method for. synthesis of large numbers 
of peptides, such as 10-20 mg of 248 different 13 residue peptides representing single amino acid 
variants of a segment of the HA1 polypeptide which were prepared and characterized (by 
ELISA-type binding studies) in less than four weeks (Houghten, R. A. Proc. Natl. Acad. Sci. 
USA 82:5131-5135 (1985)). This "Simultaneous Multiple Peptide Synthesis (SMPS) M process 
is further described in U.S. Patent No. 4,631,21 1 to Houghten and coworkers (1986). In this 
procedure the individual resins For the solid-phase synthesis of various peptides are contained in 
separate solvent-permeable packets, enabling the optimal use of the many identical repetitive steps 
involved in solid-phase methods. A completely manual procedure allows 500-1000 or more 
syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad. Sci. 
82:5131-5135 at 5134. " 

Epitope-bearing peptides and polypeptides of the invention are used to induce antibodies 
according to methods well known in the art. See, e.g., Sutcliffe, et al., supra;; Wilson, et al., 
supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354. Generally, animals may be 
immunized with free peptide; however, anti-peptide antibody titer may be boosted by coupling of 
the peptide to a macromolecular carrier, such as keyhole limpet hemacyanin (KLH)" or tetanus 
toxoid. For instance, peptides containing cysteine may be coupled to carrier using a linker such 
as m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may be 
coupled to carrier using a more general linking agent such as glutaraldehyde. Animals such as 
rabbits, rats and mice are immunized with either free or carrier-coupled peptides, for instance, by 
intraperitoneal and/or intradermal injection of emulsions containing about 100 jig peptide or 
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carrier protein and Freund's adjuvant. Several booster injections may be needed, for instance, at 
intervals of about two weeks, to provide a useful titer of anti-peptide antibody which can be 
detected, for example, by ELIS A assay using free peptide adsorbed to a solid surface. The titer of 
anti-peptide antibodies in serum from an immunized animal may be increased by selection of 
5 anti-peptide antibodies, for instance, by adsorption to the peptide on a solid support and elution of 
the selected antibodies according to methods well known in the art. 

Immunogenic epitope-bearing peptides of the invention, i.e., those parts of a protein that 
elicit an antibody response when the whole protein is the immunogen, are identified according to 
methods known in the art. For instance,. Geysen, et al, supra, discloses a procedure for rapid 
10 concurrent synthesis on solid supports of hundreds of peptides of sufficient purity to react in an 
ELIS A. Interaction of synthesized peptides with antibodies is then easily detected without 
' removing them from the support. In this manner a peptide bearing an immunogenic epitope of a 

desired protein may be identified routinely by one of ordinary skill in the art. For instance, the 
(ill immunologically important epitope in the coat protein of foot-and-mouth disease virus was located 

by Geysen et al suprci with a resolution of seven amino acids by synthesis of an overlapping set 
?U of all 208 possible hexapeptides covering the entire 213 amino acid sequence of the protein. 
;f; ■ Then, a complete replacement set of peptides in which all 20 amino acids were substituted in turn 

at every position within the epitope were synthesized, and the particular amino acids conferring 
W specificity for the reaction with antibody were determined. Thus, peptide analogs of the 
fi|0 epitope-bearing peptides of the invention can be made routinely by this method. U.S. Patent No. 
4,708,781 to Geysen (1987) further describes this method of identifying a peptide bearing an 
immunogenic epitope of a desired protein. 

Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a general method 
of detecting or determining the sequence of monomers (amino acids or other compounds) which 
25 is a topological equivalent of the epitope {i.e., a "mimotope") which is complementary to a 

particular paratope (antigen binding site) of an antibody of interest. More generally, U.S. Patent 
No. 4,433,092, also to Geysen (1989), describes a method of detecting or determining a 
sequence of monomers which is a topographical equivalent of a ligand which is complementary to 
the ligand binding site of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 
30 to Houghten, R. A. et al (1996) discloses linear C r C 7 -alkyl peralkylated oligopeptides and sets 
and libraries of such peptides, as well as methods for using such oligopeptide sets and libraries 
for determining the sequence of a peralkylated oligopeptide that preferentially binds to an acceptor 
molecule of interest. Thus, non-peptide analogs of the epitope-bearing peptides of the invention 
also can be made routinely by these methods. The entire disclosure of each document cited in this 
35 section on "Polypeptides and Fragments" is hereby incorporated herein by reference. 

As one of skill in the art will appreciate, the polypeptides of the present invention and the 
epitope-bearing fragments thereof described above can be combined with parts of the constant 
domain of immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. This has been shown, e.g., for 
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chimeric proteins consisting of the first two domains of the human CD4-polypeptide and various 
domains of the constant regions of the heavy or light chains of mammalian immunoglobulins. 
(EPA 0,394,827; Traunecker et al. (1988) Nature 331:84-86. Fusion proteins that have a 
disulfide -linked dimeric structure due to the IgG part can also be more efficient in binding and 
neutralizing other molecules than a monomelic B. burgdorferi polypeptide or fragment thereof 
alone. See Fountoulakis et al. (1995) J. Biochem. 270:3958-3964. Nucleic acids-encoding the 
above epitopes of B. burgdorferi polypeptides can also be recombined with a gene of interest as- 
an epitope tag to aid in detection and purification of the expressed polypeptide. 

Antibodies 

' B. burgdorferi protein-specific antibodies for use in the present invention can be raised 
against the intact B. burgdorferi protein or an antigenic polypeptide, fragment thereof, which may 
be presented together with a carrier protein, such as an albumin, to an animal system (such as , 
rabbit or mouse) or, if it is long enough (at least about 25 amino acids), without a carrier. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to 
include intact molecules, single chain whole antibodies, and antibody fragments. Antibody 
fragments of the present invention include Fab and F(ab')2 and other fragments including single- 
chain Fvs (scFv) and disulfide-linked Fvs (sdFv). Also included in the present invention are 
chimeric and humanized monoclonal antibodies and polyclonal antibodies specific for the 
polypeptides of the present invention. The antibodies of the present invention may be prepared by 
any of a variety of methods. For example, cells expressing a polypeptide of the present invention 
or an antigenic fragment thereof can be administered to an animal in order to induce the production 
of sera containing polyclonal antibodies. For example, a preparation of B. burgdorferi 
-polypeptide or fragment thereof is prepared and purified to render it substantially free of natural 
contaminants. Such apreparatidn is then introduced into an animal in order to produce polyclonal 
antisera of greater specific activity . 

In a preferred method, the antibodies of the present invention are monoclonal antibodies or 
binding fragments thereof. Such monoclonal antibodies can be prepared using hybridoma 
technology". See t e.g., Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold , 
Spring Harbor Laboratory Press, 2nd ed. 1988); Hammerling, et al., in: MONOCLONAL 
ANTIBODIES AND T-CELL HYBRIDOMAS 563-681 (Elsevier, N.Y., 1981). Fab and 
F(ab')2 fragments may be produced by proteolytic cleavage, using enzymes such as papain (to 
produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, B. burgdorferi 
polypeptide -binding fragments, chimeric, and humanized antibodies can be produced through the 
application of recombinant DNA technology or through synthetic chemistry using methods known 
in the art. 

Alternatively, additional antibodies capable of binding to the polypeptide antigen of the 
present invention may be produced in a two-step procedure through the use of anti-idiotypic 
antibodies. Such a method makes use of the fact that antibodies are themselves antigens, and that, 
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therefore, it is possible to obtain an antibody which binds to a second antibody. In accordance 
with this method, B. burgdorferi polypeptide-specific antibodies are used to immunize an animal, 
preferably a mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an antibody whose ability 
to bind to the B. burgdorferi polypeptide-specific antibody can be blocked by the B. burgdorferi 
polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to the B. burgdorferi 
polypeptide-specific antibody and can be used to immunize an animal to induce formation of 
further B, burgdorferi polypeptide-specific antibodies. 

Antibodies and fragements thereof of the present invention may be described by the 
portion of a polypeptide of the present invention recognized or specifically bound by the antibody. 
Antibody binding fragements of a polypeptide of the present invention may be described or 
specified in the same manner as for polypeptide fragements discussed above., i.e, by N-terminal 
and C-terminal positions or by size in contiguous amino acid residues. Any number of antibody, 
binding fragments, of a polypeptide of the present invention, specified by N-terminal and C- 
terminal positions or by size in amino acid residues, as described above, may also be excluded 
from the present invention. Therefore, the present invention includes antibodies the specifically 
bind a particuarlly discribed fragement of a polypeptide of the present invention and allows for the 
exclusion of the same. 

Antibodies and fragements thereof of the present invention may also be described or 
specified in terms of their cross-reactivity. Antibodies and fragements that do not bind 
polypeptides of any other species of Borrelia other than B. burgdorferi are included in the present 
invention. Likewise, antibodies and fragements that bind only species of Borrelia, i.e. antibodies 
and fragements that do not bind bacteria from any genus other than Borrelia, are included in the 
present invention. 

Diagnostic Assays 

The present invention further relates to methods for assaying staphylococcal infection in 
an animal by detecting the expression of genes encoding staphylococcal polypeptides of the. 
present invention. The methods comprise analyzing tissue or body fluid from the animal for 
Borrelia-sp^cific antibodies, nucleic acids, or proteins. Analysis of nucleic acid specific to 
Borrelia is assayed by PCR or hybridization techniques using nucleic acid sequences of the 
present invention as either hybridization probes or primers. See, e.g., Sambrook et al. Molecular 
cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 
reference); Eremeeva et al. (1994) J. Clin. Microbiol. 32:803-8 10 (describing differentiation 
among spotted fever group Rickettsiae species by analysis of restriction fragment length 
polymorphism of PCR-amplified DNA) and Chen et al. 1994 J. Clin. Microbiol. 32:589-595 
(detecting B: burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Borrelia has already been 
made, the present invention is useful for monitoring progression or regression of the disease state 
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whereby patients exhibiting enhanced Borrelia gene expression will experience a worse clinical 
outcome relative to patients expressing these gene(s) at a lower level. 

By "biological sample" is intended any biological sample obtained from an animal, cell 
line, tissue culture, or other source which contains Borrelia polypeptide, mRNA, or DNA. 
Biological samples include body fluids (such as saliva, blood, plasma, urine, mucus, synovial 
fluid, etc.) tissues (such as muscle, skin, and cartilage) and any other biological source suspected 
of containing Borrelia polypeptides or nucleic acids. -Methods for obtaining biological samples 
such as tissue are well known in the art. , * . . 

The present invention is useful for detecting diseases related to Borrelia infections in 
animals. Preferred animals include monkeys, apes, cats, dogs, birds, cows, pigs, mice, horses, 
rabbits and humans. Particularly preferred are humans. 

Total RNA can- be isolated from a biological sample using any suitable technique such as 
the single-step guanidinium-thiocyanate-phenol-chloroform method described in Chomczynski et 
al. (1987) Anal. Biochem: 162: 156- 159. mRNA encoding Borrelia polypeptides having sufficient 
homology to the nucleic acid sequences identified in Table 1 to allow for hybridization between 
complementary sequences are then assayed using any appropriate method. These include 
Northern blot analysis, SI nuclease mapping; the polymerase chain reaction (PCR), reverse 
transcription in combination with the polymerase chain reaction (RT-PCR), and reverse 
transcription in combination with the ligase chain reaction (RT-LCR). 

Northern blot analysis can be performed as described in Harada et al. (199Q) Cell 
63:303-3 12. Briefly, total RNA is prepared from a biological sample as described above. For the 
Northern blot, the RNA is denatured in an appropriate buffer (such as glyoxal/dimethyl 
sulfoxide/sodium phosphate buffer), subjected to agarose gel electrophoresis, and transferred 
onto a nitrocellulose filter. After the RNAs have been linked to the filter by a UV linker; the filter 
is prehybridized in a solution containing formamide, SSC, Denhardt's solution, denatured salmon 
sperm, SDS, and sodium phosphate buffer. . A B. burgdorferi polynucleotide sequence shown in 
Table 1 labeled according to any appropriate method (such as the 32 P-multiprimed DNA labeling 
system' (Amersham)) is used as probe. After hybridization overnight, the filter is washed and 
exposed to x-ray film. DNA for use as probe according to the present invention is described in 
the sections above and will preferably at least 15 nucleotides in length. 

SI mapping can be performed as described in Fujita et al. (1987) Cell 49:357-367. To 
prepare probe DNA for use in SI mapping, the sense strand of an above-described B. burgdorferi 
DNA sequence of the present invention is used as a template to synthesize labeled antisense DNA. 
The antisense DNA can then be digested using an appropriate restriction endonuclease to generate 
further DNA probes of a desired length. Such antisense probes are useful for visualizing 
protected bands corresponding to the target mRNA {i.e., mRNA encoding Borrelia polypeptides). 

Levels of mRNA encoding Borrelia polypeptides are assayed, for e.g., using the RT-PCR 
method described in Makino et al. (1990) Technique 2:295-301. By this method, the 
radioactivities of the "amplicons" in the polyacrylamide gel bands are linearly related to the initial 
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concentration of the target mRNA. Briefly, this method involves adding total RNA isolated from 
a biological sample in a reaction mixture containing a RT primer and appropriate buffer. After 
incubating for primer annealing, the mixture can be supplemented with a RT buffer, dNTPs, 
DTT, RNase inhibitor and reverse transcriptase. After incubation to achieve reverse transcription 
of the RNA, the RT products are then subject to PCR using labeled primers. Alternatively, rather 
than labeling the primers, a labeled dNTP can be included in the PCR reaction mixture. PCR 
amplification can be performed in a DNA thermal cycler according to conventional techniques. 
After a suitable number of rounds to achieve amplification, the PCR reaction mixture is 
electrophoresed on a polyacrylamide gel. After drying the gel, the radioactivity of the appropriate 
bands (corresponding to the mRNA encoding the Borrelia polypeptides of the present invention) 
are quantified using an imaging analyzer. RT and PCR reaction ingredients and conditions, 
reagent and gel concentrations, and labeling methods are well known in the art. Variations on the 
RT-PCR method will be apparent to the skilled artisan. Other PCR methods that can detect the 
nucleic acid of the present invention can be found in PCR PRIMER: A LABORATORY ' L 
MANUAL (C.W. Dieffenbach et al. eds., Cold Spring Harbor Lab Press, 1995). 

The polynucleotides of the present invention, including both DNA and RNA, may be used 
to detect polynucleotides of the present invention or Borrelia species including B. burgdorferi 
using bio chip technology. The present invention includes both high density chip arrays (>1000 
oligonucleotides per cm 2 ) and low density chip arrays (<1000 oligonucleotides per cm 2 ). Bio 
chips comprising arrays of polynucleotides of the present invention may be used to detect Borrelia 
species, including B. burgdorferi, in biological and environmental samples and to diagnose an 
animal, including humans, with an B. burgdorferi or other Borrelia infection. The bio chips of 
the present invention may comprise polynucleotide sequences of other pathogens including 
bacteria, viral, parasitic, and fungal polynucleotide sequences, in addition to the polynucleotide 
sequences of the present invention, for use in rapid diffenertial pathogenic detection and 
diagnosis. The bio chips can also be used to monitor an B. burgdorferi or other Borrelia 
infections and to monitor the genetic changes (deletions, insertions, mismatches, etc.) in response 
to drug therapy in the clinic and drug development in the laboratory. The bio chip technology 
comprising arrays of polynucleotides of the present invention may also be used to simultaneously 
monitor the expression of a multiplicity of genes, including those of the present invention. The 
polynucleotides used to comprise a selected array may be specified in the same manner as for the 
fragements, i.e, by their 5' and 3' positions or length in contigious base pairs and include from. 
Methods and particular uses of the polynucleotides of the present invention to detect Borrelia • 
species, "including B. burgdorferi, using bio chip technology include those known in the art and 
those of: U.S. Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 
5527681, 5451683, 5424186, 5607646, 5658732 and World Patent Nos. WO/9710365, 
WO/951 1995,. WO/9743447, WO/9535505, each incorporated herein in their entireties. 

Biosensors using the polynucleotides of the present invention may also be used to detect, 
diagnose, and monitor B. burgdorferi or other Borrelia species and infections. thereof. 
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Biosensors using the- polynucleotides of the present invention may also be used to detect particular 
polynucleotides of the present invention. Biosensors using the polynucleotides of the present 
invention may also be used to monitor the genetic changes (deletions, insertions, mismatches, 
etc.) in response to drug therapy in the clinic and drug development in the laboratory. Methods 
and particular uses of the polynucleotides of the present invention to detect Borrelia species, 
including B. burgdorferi, using biosenors include those known in the art and those of: U.S. 
Patent Nos 5721102, 5658732, " 5631*170, and World Patent Nos. WO97/350U, WO/9720203, 
each incorporated herein in their entireties. 

Thus, the present invention includes both bio chips and biosensors comprising 
polynucleotides of the present invention and methods of their use. 

Assaying Borrelia polypeptide levels in a biological sample can occur using any art-known 
method, such as antibody-based techniques. For example, Borrelia polypeptide expression in 
tissues can be studied with classical immunbhistological methods. In these, the specific 
. recognition is provided by the prirriary antibody (polyclonal or monoclonal) but the secondary 
detection system can utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a 
result, an immunohistological staining of tissue section for pathological examination is obtained. 
Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of Borrelia 
polypeptides for Western-blot or dot/slot assay. See, e.g., Jalkanen, M. et al. (1985) J. Cell. 
BipL : ' 101:976-985; Jalkanen, M. et al. (1987) J. Cell . Biol. 105:3087-3096. In this technique, 
which is based on the use of cationic solid phases, quantitation of a Borrelia polypeptide can be 
accomplished using an isolated Borrelia polypeptide as a standard. This technique can also be 
applied to body fluids. 

Other antibody-based methods useful for detecting Borrelia polypeptide gene expression 
include immunoassays, such as the ELIS A and the radioimmunoassay (RIA). For example, a 
Borrelia polypeptide-specific monoclonal antibodies can be used both as an immunoabsorbent and 
as an enzyme-labeled probe to detect and quantify a Borrelia polypeptide. The amount of a 
Borrelia polypeptide present in the sample can be calculated by reference to the amount present in 
a standard preparation using a linear regression computer algorithm. Such an ELIS A is described 
in Iacobelli et al. (1988) Breast Cancer Research and Treatment 1 1: 19-30. In another ELISA 
assay, two distinct specific monoclonal antibodies can be used to detect Borrelia polypeptides in a 
body fluid. In this assay, one of the antibodies is used as the immunoabsorbent and the other as 
the enzyme-labeled probe. . 

The above techniques may be conducted, essentially as a "one-step" or "two-step" assay. 
The "one-step" assay involves contacting the Borrelia polypeptide with immobilized antibody and, 
without washing, contacting the mixture with the labeled antibody. The "two-step" assay 
involves washing before contacting the mixture with the labeled antibody. Other conventional 
methods may also be employed as suitable. It is usually desirable to immobilize one component 
of the assay system on a support, thereby allowing other components of the system to be brought 
into contact with the component and readily removed from the sample. Variations of the above 
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and other immunological methods included in the present invention can also be found in Harlow 
et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 
2nd ed. 1988). 

Suitable enzyme labels include, for example, those from the oxidase group, which 
catalyze the production of hydrogen peroxide by reacting with substrate. Glucose oxidase is 
particularly preferred as it has good stability and its substrate (glucose) is readily available. 
Activity of an oxidase label may be assayed by measuring the concentration of hydrogen peroxide 
formed by the enzyme-labeled antibody/substrate reaction. Besides enzymes, other suitable labels 
include radioisotopes, such as iodine ( i25 I, l21 I), carbon ( 14 C), sulphur ( 35 S), tritium ( 3 H), indium 
( 1I2 In), and technetium ( 99m Tc), and fluorescent labels, such as fluorescein and rhodamine, arid 
biotin. 

Further suitable labels for the Borrelia polypeptide-specific antibodies of the present 
invention are provided below. Examples of suitable enzyme labels include malate dehydrogenase, 
Borrelia nuclease, delta-5 -steroid isomerase, yeast-alcohol dehydrogenase, alpha-glycerol 
phosphate dehydrogenase, triose phosphate isomerase, peroxidase, alkaline phosphatase, 
asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, 
glucose-6-phosphate dehydrogenase, glucoamylase, and acetylcholine esterase. 

Examples of suitable radioisotopic labels include 3 H, ni In, 125 I, I3I I, 32 P, 35 S, 14 C, 5I Cr, 
57 To, 58 Co, 59 Fe, 75 Se, l52 Eu, 90 Y, 67 Cu,. 2I7 Ci, 211 At, 2I2 Pb, 47 Sc, 109 Pd, etc: lll In is a preferred 
isotope where in vivo imaging is used since its avoids the problem of dehalogenation of the 125 I or 
131 I-labeled monoclonal antibody by the liver. In addition, this radionucleotide has a more 
favorable gamma emission energy for imaging. See, e.g., Perkins et aL (1985) Eur. J. Nucl. 
Med. 10:296-301; Carasquillo et al. (1987) J. Nucl. Med. 28:281-287. For example, UI In 
coupled to monoclonal antibodies with l-(P-isothiocyanatobenzyl)-DPTA has shown little uptake 
in non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 
localization. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870. 

Examples of suitable non-radioactive isotopic labels include !57 Gd, 55 Mn, 162 Dy, 52 Tr, and 
- 56 Fe.. * • ' . ' 

Examples of suitable fluorescent labels include an 152 Eu label, a fluorescein label, an 
isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin label, an 
allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label. 

Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, ricin, and 
cholera toxin. ■ - 

Examples of chemiluminescent labels include a luminal label, an isoluminal label, an 
aromatic acridinium ester label, an imidazole label, an acridinium salt label, an oxalate ester label, 
a luciferin label, a luciferase label, and an aequorin label. 

Examples of nuclear magnetic resonance contrasting agents include heavy metal nuclei 
such as Gd, Mn, and iron. 

' Typical techniques for binding the above-described labels to antibodies are provided by 
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Kennedy et al. (1976) Clin. Chim. Acta 70:1-31, and Schurs et al. (1977) Clin. Chim. Acta 
81:1-40. Coupling techniques mentioned in the latter are the glutaraldehyde method, the periodate 
method, the dimaleimide method, the m-maleimidobenzyl-N-hydroxy-succinimide ester method, 
all of which methods are incorporated by reference herein. 

In a related aspect, the invention includes a diagnostic kit for use in screening serum 
containing antibodies specific against B. burgdorferi infection. Such a kit may include an 
isolated B. burgdorferi antigen comprising an epitope which is specifically immunoreactive with 
at least one anti-5. burgdorferi antibody. Such a kit also includes means for detecting the 
binding of said antibody to the antigen. In specific embodiments, the kit may include a 
recombinantly produced or chemically synthesized peptide or polypeptide antigen. The peptide or 
polypeptide antigen may be attached to a solid support. 

In a more specific embodiment, the detecting means of the above -described kit includes a 
solid support to which said peptide or polypeptide antigen is attached. Such a kit may also 
include a non-attached reporter-labeled anti-human antibody. In this embodiment, binding of the 
antibody to the B. burgdorferi antigen can be detected by binding of the reporter labeled antibody 
to the anti-2?. burgdorferi polypeptide antibody. 

In a related aspect, the invention includes a method of detecting B. burgdorferi infection in 
a subject. This detection method includes reacting a body fluid, preferably serum, from the 
subject with an isolated B. burgdorferi antigen, and examining the antigen for the presence of 
bound antibody. In a specific embodiment, the method includes a polypeptide antigen attached to 
a solid support, and serum is reacted with the support. Subsequently, the support is reacted with 
a reporter-labeled anti-human antibody. . The support is then examined for the presence of 
reporter-labeled antibody. 

The solid surface reagent employed in the above assays and kits is prepared by known 
techniques for attaching protein~material to solid support material, such as polymeric beads, dip 
sticks, 96-well plates or filter material. These attachment methods generally include non-specific 
adsorption of the protein to the support or covalent attachment of the protein , typically through a 
free amine group, to a chemically reactive group on the solid support, such as an activated 
carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can be used in 
conjunction with biotinylated antigen(s). - 

The polypeptides and antibodies of the present invention, including fragments thereof, 
may be used to detect Borrelia species including B. burgdorferi using bio chip and biosensor 
technology. Bio chip and biosensors of the present invention may comprise the polypeptides of 
the present invention to detect antibodies, which specifically recognize Borrelia species,. including 
B. burgdorferi. Bio chip and biosensors of the present invention may also comprise antibodies 
which specifically recognize the polypeptides of the present invention to detect Borrelia species, 
including B. burgdorferi or specific polypeptides of the present invention. Bio chips or 
biosensors comprising polypeptides or antibodies of the present invention may be used to detect 
Borrelia species, including B. burgdorferi, in biological and environmental samples and to 
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diagnose an animal, including humans, with an B. burgdorferi or other Borrelia infection. Thus, 
the present invention includes both bio chips and biosensors comprising polypeptides or 
antibodies of the present invention and methods of their use. 

The bio chips of the present invention may further comprise polypeptide sequences of 
other pathogens including bacteria, viral, parasitic, and fungal polypeptide sequences, in addition 
to the polypeptide sequences of the present invention, for use in rapid diffenertial pathogenic 
detection and diagnosis. The bio chips of the present invention may further comprise antibodies 
or fragements thereof specific for other pathogens including bacteria, viral, parasitic, and fungal 
polypeptide sequences, in addition to the antibodies or fragements thereof of the present 
invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and 
biosensors of the present invention may also be used to monitor an B. burgdorferi or other 
Borrelia infection and to monitor the genetic changes (amio aicid deletions, insertions, 
substitutions, etc.) in response to drug therapy in the clinic and drug development in the 
laboratory. The bio chip and biosensors comprising polypeptides or antibodies of the present- 
invention may also be used to simultaneously monitor the expression of a multiplicity of 
polypeptides, including those of the present invention. The polypeptides used to comprise a bio - 
chip or biosensor of the present invention may be specified in the same manner as for the 
fragements, i.e, by their N-terminal and C-terminal positions or length in contigious amino acid 
residue. Methods and particular uses of the polypeptides and antibodies of the present invention 
to detect Borrelia species, including B. burgdorferi, or specific polypeptides using bio chip and 
biosensor technology include those known in the art, those of the U.S. Patent Nos. and World 
Patent Nos. listed above for bio chips and biosensors using polynucleotides of the present 
invention, and those of: U.S. Patent Nos. 5658732, 5135852, 5567301, 5677196, 5690894 and 
World Patent Nos. W09729366, W09612957, each incorporated herein in their entireties. 

Treatment: . ~ 

Agonists and Antagonists - Assays and Molecules 

The invention also provides a method of screening compounds to identify those which 
enhance or block the biological activity of the B. burgdorferi polypeptides of the present 
invention. The present invention further provides where the compounds kill or slow the growth 
of B. burgdorferi. The ability of B. burgdorferi antagonists, including B. burgdorferi ligands, to 
prophylactically or therapeutically block antibiotic resistance may be easily tested by the skilled 
artisan. See, e.g., Straden et al. (1997) J Bacterid. i79(l):9-16. 

An agonist is a compound which increases the natural biological function or which 
functions in a manner similar to the polypeptides of the present invention, while antagonists 
decrease or eliminate such functions. Potential antagonists include small organic molecules, 
peptides, polypeptides, and antibodies that bind to a polypeptide of the invention and thereby 
inhibit or extinguish its activity. 

The antagonists may be employed for instance to inhibit peptidoglycan cross bridge 
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formation. Antibodies against B. burgdorferi may be employed to bind to and inhibit 5. 
burgdorferi activity to treat antibiotic resistance. Any of the above antagonists may be employed 
in a composition with a pharmaceutically acceptable carrier. 

Vaccines 

The present invention also provides vaccines comprising one or more polypeptides of the 
present. invention. Heterogeneity in the composition of a vaccine may be provided by combining 
B, burgdorferi polypeptides of the present invention. Multi -component vaccines of this type are . 
desirable because they are likely to be more effective in eliciting protective immune responses 
against multiple species and strains of the Borrelia genus than single polypeptide vaccines. Thus, 
as discussed in detail below, a multi-component vaccine of the present invention may contain one 
or more, preferably 2 to about 20, more preferably 2 to about 15, and most preferably 3 to about 
8, of the B. burgdorferi polypeptides shown in Table 1, or fragments thereof. 

Multi-component vaccines are known in the art to elicit antibody production to numerous 
immunogenic components. Decker, M. and Edwards, K., J. Infect. Dis. i74:S270-275 (1996). 
In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent vaccine has recently been 
demonstrated to elicit protective levels of antibodies in human infants against all four pathogenic 
agents. Aristegui, J. et aL, Vaccine 15:1^9 (1997). 

The present invention thus also includes multi-component vaccines. These vaccines 
comprise more than one polypeptide, immunogen or antigen. An example of such a multi- 
component vaccine would be a vaccine comprising more than one of the B. burgdorferi 
polypeptides shown in Table 1. A second example is a vaccine comprising one or more, for . 
example 2 to 10, of the B. burgdorferi polypeptides shown in Table 1 and one or more, for 
example 2 to 10, additional polypeptides of either borrelial or non-borrelial origin. Thus, a multi- 
component vaccine which confers protective immunity to both a bonrelial infection and infection 
by another pathogenic agent is also within the scope of the invention. 

As indicated above, the vaccines of the present invention are expected to elicit a protective 
immune response against infections caused by species and strains of Borrelia other than B. 
burgdorferi sensu strict© isolate B3 1 (ATCC Accession No. 35210). Immunizations using 
decorin-binding protein and OspA derived from one strain of B. burgdorferi has been shown to 
elicit the production of antiserum which confers passive immunity against other strains of B. 
burgdorferi. Cassatt, D. et aL, Protection of Borrelia burgdorferi Infection by Antibodies to 
Decorin-binding Protein, in Vaccines97, Cold Spring Harbor Press (1997), pages 191-195. . 
Further, the inventors have found using an in vitro assay that antiserum produced in response to 
B. burgdorferi decorin-binding protein will kill several species of Borrelia. The amino acid 
sequences of decorin-binding protein expressed by different strains of B. burgdorferi are 
believed to diverge by as much as 25%. Thus, antisera elicited against decorin-binding proteins 
confers passive immunity against Borrelia expressing proteins having only 75% or less amino 
acid sequence similarity. 
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Further within the scope of the invention are whole cell and whole viral vaccines. Such 
vaccines may be produced reeombinantly and involve the expression of one or more of the 
B. burgdorferi polypeptides shown in Table 1. For example, the B. burgdorferi polypeptides of 
the present invention may be either secreted or localized intracellular, on the cell surface, or in the 
periplasmic space. Further, when a recombinant virus is used, the B. burgdorferi polypeptides 
of the present invention may, for example, be localized in the viral envelope, on the surface of the 
" capsid, or internally within the capsid. Whole cells vaccines which employ cells expressing 
heterologous proteins are known in the art. See, e.g., Robinson, K. et al, Nature Biotech. 
75:653-657 (1997); Sirard, J. etaL, Infect. Immun. (55:2029-2033.(1997); Chabalgoity, J. etaL, 
Infect. Immun. 65:2402-2412 (1997). These cells may be administered live or may be killed 
prior to administration. Chabalgoity, J. et al., supra, for example, report the successful use in 
mice of a live attenuated Salmonella vaccine strain which, expresses a portion of a platyhelminth 
fatty acid-binding protein as a fusion protein on its cells surface. 

A multi-component vaccine can also be prepared using techniques known in the art by 
combining one or more B. burgdorferi polypeptides of the present invention, or fragments * 
thereof, with additional non-borrelial components (e.g., diphtheria toxin or tetanus toxin, and/or 
other compounds known to elicit an immune response). Such vaccines are useful for eliciting 
protective immune responses to both members of the Borrelia genus and non-borrelial pathogenic 
agents. . 

The vaccines of the present invention also include DNA vaccines. DNA vaccines are 
currently being developed for a number of infectious diseases. Boyer, J et al., Nat. Med. 3:526- 
532 (1997); reviewed in Spier, R^, Vaccine 14: 1285-1288 (1996). Such DNA vaccines contain a 
nucleotide sequence encoding one or more B. burgdorferi polypeptides of the present invention 
-oriented in a manner that allows for expression of the subject polypeptide. The direct 
administration of plasmid DNA "encoding OspA has been shown to elicit protective immunity in 
mice against borrelial challenge. Luke, C. et al, J. Infect. Dis. 775:91-97 (1997). 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim, J. et al., Nature 
Biotech. 75:641-646 (1997), for example, report the enhancement of immune responses produced 
by DNA immunizations when DNA sequences encoding molecules which stimulate the immune 
response are co-administered. In a similar fashion, the vaccines of the present invention may be 
co-administered with either nucleic acids encoding immune modulators or the immune modulators 
themselves. These immune modulators include granulocyte macrophage colony stimulating factor 
(GM-CSF) and CD86. 

The vaccines of the present invendon may be used to confer resistance to borrelial 
infection by either passive or active immunization. When the vaccines of the present invention are 
used to confer resistance to borrelial infection through active immunization, a vaccine of the 
present invention is administered to an animal to.elicit a.protective immune response which either 
prevents or attenuates a borrelial infection. When the vaccines of the present invention are used to 
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confer resistance to borrelial infection through passive immunization, the vaccine is provided to a 
host animal {e.g., human, dog, or mouse), and the antisera elicited by this antisera is recovered 
and directly provided to a recipient suspected of having an infection caused by a member of the 
Borrelia genus. 

The ability to label antibodies, or fragments of antibodies, with toxin molecules provides 
an additional method for treating borrelial infections when passive immunization is conducted. In 
this embodiment, antibodies, or fragments of antibodies, capable of recognizing the 
B. burgdorferi polypeptides disclosed herein, or fragments thereof, as well as other Borrelia 
proteins, are labeled with toxin molecules prior to their administration to the patient. When such 
toxin derivatized antibodies bind to Borrelia cells, toxin moieties will be localized to these cells 
and will cause their death. * 

The present invention thus concerns and provides a means for preventing or attenuating a 
borrelial infection resulting from organisms which have antigens. that are recognized and bound by 
antisera produced in response to the polypeptides of the present invention. As used herein, a 
vaccine is said to prevent or attenuate a disease if its administration to an animal results either in 
the total or partial attenuation {i.e., suppression) of a symptom or condition of the disease, or in 
the total or partial immunity of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for either a 
"prophylactic" or "therapeutic" purpose. When provided prophylactically, the compound(s) are 
provided in advance of any symptoms of borrelial infection. The prophylactic administration of 
the compound(s) serves to prevent or attenuate any subsequent infection. When provided 
therapeutically, the compound(s) is provided upon or after the detection of symptoms which 
indicate that an animal may be infected with a member of the Borrelia genus. The therapeutic 
administration of the compound(s) serves to attenuate any actual infection. Thus, the 
B. burgdorferi polypeptides, anil fragments thereof, of the present invention may be provided 
either prior to the onset of infection (so as to prevent or attenuate an anticipated infection) or after 
the initiation of an actual infection. 

The polypeptides of the invention, whether encoding a portion of a native protein or a 
functional derivative thereof, may be administered in pure form or may be coupled to a. 
macromolecular carrier. Example of such carriers are proteins and carbohydrates. Suitable 
proteins which may act as macromolecular carrier for enhancing the immunogenicity of the 
polypeptides of the present invention include keyhole limpet hemacyanin (KLH) tetanus toxoid, 
pertussis toxin, bovine serum albumin, and ovalbumin. Methods for coupling the polypeptides of 
the present invention to such macromolecular carriers are disclosed in Harlow et aL, Antibodies: 
A Laboratory Manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (1988), the entire disclosure of which is incorporated by. reference herein. 

Axomposition is said to be "pharmacologically acceptable" if its administration can be 
tolerated by a recipient animal and is otherwise suitable for administration to that animal. Such an 
agent is said to be administered in a "therapeutically effective amount" if the amount administered 
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is physiologically significant. An agent is physiologically significant if its presence results in a 
detectable change in the physiology of a recipient patient. 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that the 
5 composition of a pharmacologically acceptable compound varies with the animal to which it is 
administered: - For example, a vaccine intended for human use will generally not be co- 
administered with Freund's adjuvant. Further, the level of purity of the B. burgdorferi 
polypeptides of the present invention will normally be higher when administered to a human than 
when administered to a non-human animal. 
10 As would be understood by one of ordinary, skill in the art, when the. vaccine of the 

present invention is provided to an animal, it may be in a composition which may contain salts, 
buffers, adjuvants, or other substances which are desirable for improving the efficacy of the 

SJ composition. Adjuvants are substances that can be used to specifically augment a specific 
immune response. These substances generally perform two functions: (1) they protect the 

UJl5 antigen(s) from being rapidly catabolized after administration and (2) they nonspecifically 

Jjj stimulate immune responses. 

in] Normally, the adjuvant and the composition are mixed prior to presentation to the immune 

R| system, or presented separately, but into the same site of the animal being immunized. Adjuvants 
?*! can be loosely divided into several groups based upon their composition. These groups include 
vCfeo oil adjuvants (for example, Freund's complete and incomplete), mineral salts (for example, 
H A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , A1NH 4 (S0 4 ), silica, kaolin, and carbon), polynucleotides (for example, 
(3 P°ty IC and poly AU acids), and certain natural substances (for example, wax D from 
• Mycobacterium tuberculosis, as well as substances found in Corynebacterium parvum, or 

Bordetella pertussis, and members, of the genus Brucella, Other substances useful as adjuvants 
25 are the saponins such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for 
use in the present invention include aluminum salts, such as A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , and 
A1NH 4 (S0 4 ). Examples of materials suitable for use in vaccine compositions, are provided in 
Remington's Pharmaceutical Sciences (Osol, A, Ed, Mack Publishing Co, Easton, PA, pp. 1324- * 
1341 (1980), which reference is incorporated herein by reference). 
30 The therapeutic compositions of the present invention can be administered parenterally by 

injection, rapid infusion, nasopharyngeal absorption (intranasopharangeally), dermoabsorption, 
or orally. The compositions may alternatively be administered intramuscularly, or intravenously. 
Compositions for parenteral administration include sterile aqueous or non-aqueous solutions, 
suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
35 polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Carriers or occlusive dressings can be used to increase skin permeability and enhance 
antigen absorption. Liquid dosage forms for oral administration may generally comprise a 
liposome solution containing the liquid dosage form. Suitable forms for suspending liposomes 
include emulsions, suspensions, solutions, syrups, and elixirs containing inert diluents 
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commonly used in the art, such as purified water. Besides the inert diluents, such compositions 
can also include adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, 
flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
encapsulated form. For example, intranasal immunization of mice against Bordetella pertussis 
infection using vaccines encapsulated in biodegradable microsphere composed of poly(DL-lactide- 
co-glyc'olide) has been shown to stimulate protective immune responses. Shahin, R. et aL, 
Infect. Immun. (53:1195-1200 (1995). Similarly, orally administered encapsulated Salmonella 
typhimurium antigens have also been shown to elicit protective immunity in mice. Allaoui- 
Attarki, K. et aL, Infect. Immun. 65:853-857 (1997). Encapsulated vaccines of the present 
invention can be administered by a variety of routes including those involving contacting the 
vaccine with mucous membranes (e.g., intranasally, intracolonicly, intraduodenally). 

Many different techniques exist for the timing of the immunizations when a multiple 
administration regimen is utilized. It is possible to use the compositions of the invention more 
than once to increase the levels and diversities of expression of the immunoglobulin repertoire 
expressed by the immunized animal. Typically, if multiple immunizations are given, they will be 
given one to two months apart. 

According to the present invention, an "effective amount" of a therapeutic composition is 
one which is sufficient to achieve a desired biological effect. Generally, the dosage needed to 
provide an effective amount of the composition will vary depending upon such factors as the 
animal's or human's age, condition, sex, and extent of disease, if any, and other variables which . 
can be adjusted by one of ordinary skill' in the art. 

The antigenic preparations of the invention can be administered by either single or multiple 
dosages of an effective amount. Effective amounts of the compositions of the invention can vary 
from 0.01-1,000 |ig/ml per dose,' more preferably 0.1-500 |ig/ml per dose, and most preferably 
10-300 (ig/ml per dose. 

Having now generally described the invention, the same will be more readily understood 
through reference to the following example which is provided by way of illustration, and is not 
intended to be limiting of the present invention, unless specified. . 

Examples 

1. Preparation of PCR Primers and Amplification of DNA 
Various fragments of the Borrelia burgdorferi genome, such as those of Table 1, can be 
used, in accordance with the present invention, to prepare PCR primers for a variety of uses. The 
PCR primers are preferably at least 15 bases, and more preferably at least 18 bases in length. 
When selecting a primer sequence, it is preferred that the primer pairs have approximately the 
same G/C ratio, so that melting temperatures are approximately the same. The PCR primers and 
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amplified DNA of this Example find use in the Examples that follow. 

2. Isolation of a Selected DNA Clone From B. burgdorferi 
Three approaches are used to isolate a B. burgdorferi clone comprising a polynucleotide of 
the present invention from any B. burgdorferi genomic DNA library. The B. burgdorferi strain 
B31PU has been deposited as a convienent source for obtaining a B. burgdorferi strain although a 
wide varity of strains B. burgdorferi strains can be used which are known in the art. 

B. burgdorferi genomic DNA is prepared using the following method. A 20ml overnight 
bacterial culture grown in a rich medium (e.g., Trypticase Soy Broth, Brain Heart Infusion- broth 
or Super broth), pelleted, ished two times with TES (30mM Tris-pH 8.0, 25mM EDTA, 50mM 
NaCl), and resuspended in 5ml high salt TES (2.5M NaCl). Lysostaphin is added to final 
concentration of approx 50ug/ml and- the mixture is rotated slowly 1 hour at 37C to make 
protoplast cells. The solution is then placed in incubator (or place in a shaking water bath) and 
warmed to 55C. Five hundred micro liter of 20% sarcosyl in TES (final concentration 2%) is 
then added to lyse the cells. Next, guanidine HC1 is added to a final concentration of 7M (3.69g 
in 5.5 ml). The mixture is swirled slowly at 55C for 60-90 min (solution should clear). A CsCl 
gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M. CsCl and overlaying with 
2.85M CsCl. The gradient is carefully overlayed with the DNA-containing GuHCl solution. The 
gradient is spun at 30,000 rpm, 20C for 24 hr and the lower DNA band is collected. The volume 
is increased to 5 ml with TE buffer. The DNA is then treated with protease K (10 ug/ml) 
overnight at 37 C, and precipitated with ethanol. The precipitated DNA is resuspended in a - 
desired buffer. 

In the first method, a plasmid is directly isolated by screening a plasmid B; burgdorferi 
genomic DNA library using a polynucleotide probe corresponding to a polynucleotide of the 
present invention. Particularly ,'a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. The 
oligonucleotide is labeled, for instance, with 32 P-y-ATP using T4 polynucleotide kinase and 
purified according to routine methods. {See, e.g., Maniatis et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) The library is ' 
transformed into a suitable host, as indicated above (such as XL-1 Blue (Stratagene)) using 
techniques known to those of skill in the art. See, e.g., Sambrook et al. MOLECULAR 
CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et 
al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, N.Y. 
1989). The transformants are plated on 1.5% agar plates (containing the appropriate selection 
agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. These plates 
are screened using Nylon membranes according to routine methods for bacterial colony screening. 
See, e.g., Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold 
Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN 
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MOLECULAR BIOLOGY (John Wiley and Sons, N. Y. 1989) or other techniques known to 
those of skill in the art. 

Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends of a 
polynucleotide of Table 1 are synthesized and used to amplify the desired DN A by PCR using a 
B. burgdorferi genomic DNA prep as a template. PCR is carried out under routine conditions, for 
instance, in 25 jal of reaction mixture with 0.5 ug of the above DNA template. A convenient 
reaction mixture is 1.5-5 mM MgCl 2 , 0.01% (w/v) gelatin, 20 jlxM each of dATP, dCTP, dGTP, 
dTTP, 25 pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR 

(denaturation at 94°C for 1 min; annealing at 55°C for 1 min; elongation at 72°C for .1 min) are 

performed with a Perkin-Elmer Cetus automated thermal cycler. The amplified product is 
analyzed by agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified.. The PCR product is verified to be the selected sequence by subcloning and 
sequencing the DNA product. 

Finally, overlapping oligos of the DNA sequences of Table 1 can be chemically 
synthesized and used to generate a nucleotide sequence of desired length using PCR methods 
known in the art. ^ 

3(a). Expression and Purification Bprrelia polypeptides in E. coli 

The bacterial expression vector pQE60 is used for bacterial expression of some of the 
polypeptide fragements of the present invention. (QIAGEN, Inc., 9259 Eton Avenue, 
Chatsworth, CA, 91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains, 
a bacterial origin of replication ("ori"), an IPTG inducible promoter, a ribosome binding site 
( M RBS")» six codons encoding histidine residues that allow affinity purification using nickel- 
nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) and suitable single 
restriction enzyme cleavage sites. These elements are arranged such that an inserted DNA 
fragment encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a "6 X 
His tag") covalendy linked to the carboxyl terminus of that polypeptide. 

The DNA sequence encoding the desired portion of a B. burgdorferi protein of the present 
invention is amplified from B. burgdorferi genomic DNA using PCR oligonucleotide primers 
which- anneal to the 5' and 3' sequences coding for the portions of the B, burgdorferi 
polynucleotide shown in Table 1 . Additional nucleotides containing restriction sites to facilitate 
cloning in the pQE60 vector are added to the 5' and 3' sequences, respectively. 

For cloning the mature protein, the 5' primer has a sequence containing an appropriate 
restriction site followed by nucleotides of the amino terminal coding sequence of the desired B. 
burgdorferi polynucleotide sequence in Table 1. One of ordinary skill in the art would appreciate 
that the point in the protein coding sequence where the 5' and 3' primers begin may be varied to 
amplify a DNA segment encoding any desired portion of the complete protein shorter or longer 
than the mature form. The 3' primer has a sequence containing an appropriate restriction site 




followed by nucleotides complementary to the 3' end of the polypeptide coding sequence of Table 
1, excluding a stop codon, with the .coding sequence aligned with the restriction site so as to 
maintain its reading frame with that of the six His codons in the pQE60 vector. 

The amplified B. burgdorferi DNA fragment and the vector pQE60 are digested with 
5 restriction enzymes which recognize the sites in the primers and the digested DNAs are then 

ligated together. The B. burgdorferi DNA is inserted into the restricted pQE60 vector in a mariner 
which places the B. burgdorferi protein coding region downstream from the EPTG-inducible 
promoter and in- frame with an initiating AUG and the six histidine codons. 

The ligation mixture is transformed into competent E. coli 'cells using standard procedures 
10 such as those described by Sambrook et al., supra.. E. coli strain M15/rep4, containing multiple 
copies of the plasmid pREP4, which expresses the lac repressor and confers kanamycin resistance 
("Kanr"), is used in carrying out the illustrative example described herein. This strain, which is 
only one of many that are suitable for expressing a B. burgdorferi polypeptide, is available 
W commercially (QIAGEN, Inc., supra), Transformants are identified by their ability to grow on 
].|j5 LB agar plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from 
W resistant colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and 
)lj DNA sequencing. 

y Clones containing the desired constructs are grown overnight ("O/N") in liquid culture in 

W LB media supplemented with both ampicillin (100 jig/ml) and kanamycin (25 |ig/ml). The O/N 
(fp culture is used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells 
ft! are grown to an optical density at 600 nm ("OD600") of between 0.4 and O.6.. Isopropyl-p-D- 
thiogalactopyranoside ( ,, IPTG M ) is then added to a final concentration of 1 mM to induce 
transcription from the lac repressor sensitive promoter, by inactivating the lad' repressor. Cells 
subsequently are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

25 " The cells are then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH 8. The cell 

debris is removed by centrifugation, and the supernatant containing the B. burgdorferi 
. polypeptide is loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column 

(QIAGEN, Inc., supra). Proteins" with a 6 x His tag bind to the Ni-NTA resin with high affinity 
are purified in a simple one-step procedure (for details see: The QIAexpressionist, 1995, - 

30 QIAGEN, Inc., supra). Briefly the supernatant is loaded onto the column in 6 M guanidine-HCl, 
pH S, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed 
with 10 volumes of 6 M guanidine-HCl pH 6, and finally the B. burgdorferi polypeptide is eluted 
with 6 M guanidine-HCl, pH 5. 

'The purified protein is then renatured by dialyzing it against phosphate-buffered saline 

35 (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the protein could be 
successfully refolded while immobilized on the Ni-NTA column. The recommended conditions 
are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 
mM Tris/HCl pH 7.4, containing protease inhibitors. The renaturation should be performed over 
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a period of 1.5 hours or more. After renaturation the proteins can be eluted by the addition of 250 
mM immidazole. Immidazole is removed by a final dialyzing step against PBS or 50 mM sodium 
acetate pH 6 buffer plus 200 mM NaCL The purified protein is stored at 4° C or frozen at -80° C. 

The polypeptide of the present invention are also prepared using a non-denaturingprotein 
purification method. For these polypeptides, the cell pellet from each liter of culture is 
resuspended in 25 mis of Lysis Buffer A at 4°C (Lysis Buffer A = 50 mM Na-phosphate, 300 
mM NaCl, 10 mM 2-mercaptoethanol, 10% Glycerol, pH 7.5 with 1 tablet of Complete EDTA- 
free protease inhibitor cocktail (Boehringer Mannheim #1873580) per 50 ml of buffer). 
Absorbance at 550 nm is approximately 10-20 O.D./ml. The suspension is then put through three 
freeze/thaw cycles from -70°C (using a ethanol-dry ice bath) up to room temperature. The cells 
are lysed via sonication in short 10 sec bursts over 3 minutes at approximately 80W while kept on 
ice. The sonicated sample is then centrifuged at 15,000 RPM for 30 minutes at 4°C. The 
supernatant is passed through a column containing 1.0 ml of CL-4B resin to pre-clear the sample 
of any proteins that may bind to agarose non-specifically, and the flow-through fraction is 
collected. 

The pre-cleared flow-through is applied to a nickel-nitrilo-tri-acetic acid ("Ni-NTA") 
affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag bind to the Ni-NTA 
resin with high affinity and can be purified in a simple one-step procedure. Briefly, the 
supernatant is loaded onto the column in Lysis Buffer A at 4°C, the column is first washed with 
10 volumes of Lysis Buffer A until the A280 of the eluate returns to the baseline. Then, the 
column is washed with 5 volumes of 40 mM Imidazole (92% Lysis Buffer A / 8% Buffer B) 
(Buffer B = 50 mM Na-Phosphate, 300 mM NaCl, 10% Glycerol, 10 mM 2-mercaptoethanol, 
500 mM Imidazole, pH of the final buffer should be 7.5). The protein is eluted off of the column 
with a series of increasing Imidazole solutions made by adjusting the ratios of Lysis Buffer A to 
Buffer B. Three different concentrations are used: 3 volumes of 75 mM Imidazole, 3 volumes of 
150 mM Imidazole, 5 volumes of 500 mM Imidazole. The fractions containing the purified 
protein are analyzed using 8 %, 10 % or 14% SDS-PAGE depending on the protein size. The 
purified protein is then dialyzed 2X against phosphate-buffered saline (PBS) in order to place it 
into an easily workable buffer. The purified protein is stored at 4°C or frozen at -80°. - 

The following alternative method may be used to purify B. burgdorferi expressed in E coli 
when it is present in the form of inclusion bodies. Unless otherwise specified, all of the 

following steps are conducted at 4- 1 0°C. 

Upon completion of the production phase of the E. coli fermentation, the cell culture is 

cooled to 4-10°C and the cells are harvested by continuous centrifugation at 15,000 rpm (Heraeus 

Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the 
amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended 
in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a 
homogeneous suspension using a high shear mixer. 
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The cells are then lysed by passing the solution through a microfluidizer (Microfuidics, 

Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCl 
solution to a final concentration of 0.5 M NaCl, followed by centrifugation at 7000 x g for 15 
min. The resultant pellet is washed again using 0.5M NaCl, 100 mM Tris, 50 mM EDTA, pH 
7.4. . 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride 
(GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the 
B. burgdorferi polypeptide-containing supernatant is' incubated at 4 C C overnight to allow further 
GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, the 
GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 volumes of 
buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by vigorous stirring. 
The refolded diluted protein solution is kept at 4 6 C without mixing for 12 hours prior to further 
purification steps. 

To clarify the refolded B. burgdorferi polypeptide solution, a previously prepared 
tangential filtration unit equipped with 0. 16 urn membrane filter with appropriate surface area 
(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample 
is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). The column 
is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 
1500 mM NaCl in the same buffer, in a stepwise manner. The absorbance at 280 mm of the 
effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the B. burgdorferi polypeptide are then pooled and mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, 
Perseptive Biosystems) exchange resins. The columns are equilibrated with 40 mM sodium 
acetate, pH 6.0. Both columns are. washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. 
The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M 
NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions 
are collected under constant A^ monitoring of the effluent. Fractions containing the B. 
burgdorferi polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant B. burgdorferi polypeptide exhibits greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands are observed from Commassie blue 
stained 16% SDS-PAGE gel when 5 jig of purified protein is loaded. The purified protein is also 
tested for endotoxin/LPS contamination, and typically the LPS content is less than 0. 1 ng/ml 
according to LAL assays. 

3(b). Alternative. Expression and Purification Borrelia polypeptides in E. 
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coli 

Tthe vector pQElO is alternatively used to clone and express some of the polypeptides of 
the present invention for use in the soft tissue and systemic infection models discussed below. 
The difference being such that an inserted DNA fragment encoding a polypeptide expresses that 
polypeptide with the six His residues (i.e., a "6 X His tag") covalently linked to the amino 
terminus of that polypeptide. The bacterial expression vector pQElO (QIAGEN, Inc., 9259 Eton 
Avenue, Chatsworth, CA, 91311) was used in this example . The components of the pQElO 
plasmid are arranged such that the inserted DNA sequence encoding a polypeptide of the present 
invention expresses the polypeptide with the six His residues {i.e., a. "6 X His tag")) covalently •- 
linked to the amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of Table 1 were, 
amplified using PCR oligonucleotide primers from genomic B. burgdorferi DNA. The PCR 
primers anneal to the nucleotide sequences encoding the desired amino acid sequence of a 
polypeptide of the present invention. Additional nucleotides containing restriction sites to 
facilitate cloning in the pQElO vector were added to the 5' and 3 1 primer sequences, respectively. . 

For cloning a polypeptide of the present invention, the 5' and 3' primers were selected to 
amplify their respective nucleotide coding sequences. One of ordinary skill in the art would 
appreciate that the point in the protein coding sequence where the 5' and 3' primers begins may be 
varied to amplify a DNA segment encoding any desired portion of a polypeptide of the present 
invention. The 5" primer was designed so the coding sequence of the 6 X His tag is aligned with 
the restriction site so. as to maintain its reading frame with that of B. burgdorferi polypeptide. The 
3' was designed to include an stop codon. The amplified DNA fragment was then cloned, and the 
protein expressed, as described above for the pQE60 plasmid. 

The DNA sequences of Table 1 encoding amino acid sequences may also be cloned and 
expressed as fusion proteins by a protocol similar to that described directly above, wherein the 
pET-32b(+) vector (Novagen, 601 Science Drive, Madison, WI 5371 1) is preferentially used in 

place of p.QETO. ~ . 

The above methods are not limited to the polypeptide fragements actually produced. The 
above method, like the methods below, can be used to produce either full length polypeptides or 
desired fragements therof. 

3(c). Alternative Expression and Purification of Borrelia polypeptides in 

E. coli - , 

The bacterial expression vector pQE60 is used for bacterial expression in this example 
(QIAGEN, Inc., 9259 Eton Avenue,' Chatsworth, CA, 91311).. However, in this example, the 
polypeptide coding sequence is inserted such that translation of the six His codons is prevented 
and, therefore, the polypeptide is produced with no 6 X His tag. - 

The DNA sequence encoding the desired portion of the B. burgdorferi amino acid 
sequence is amplified from an B. burgdorferi genomic DNA prep the deposited DNA clones 
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using PCR oligonucleotide primers which anneal to the 5' and 3' nucleotide sequences 
corresponding to the desired portion of the B. burgdorferi polypeptides. Additional nucleotides 
containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5' and 3' 
primer sequences. 

5. For cloning a B. burgdorferi polypeptides of the present invention, 5' and 3' primers are 

selected to amplify their respective nucleotide coding sequences. One of ordinary skill in the art 
would appreciate that the point in the protein coding sequence where the 5' and .3' primers begin 
may be varied to amplify a DNA segment encoding any desired portion of a polypeptide of the 
present invention. The 3' and 5* primers contain appropriate restriction sites followed by 
10 nucleotides complementary to the 5' and 3' ends of the coding sequence respectively. The 3' 
primer is additionally designed to include an in-frame stop codon. 

The amplified B. burgdorferi DNA fragments and the vector pQE60 are digested with 
restriction enzymes recognizing the sites in the primers and the digested DNAs are then ligated 
0 together. Insertion of the 5. burgdorferi DNA into the restricted pQE60 vector places the B. 

burgdorferi protein coding region including its associated stop codon downstream from the IPTG- 
|j inducible promoter and in-frame with an initiating AUG. The associated stop codon prevents 

translation of the six histidine codons downstream of the insertion point. 
|j The ligation mixture is transformed into competent E. coli cells using standard procedures 

such as those described by Sambrook et al. E. coli strain M15/rep4, containing multiple copies of 
the plasmid pREP4, which expresses the lac repressor and confers kanamycin resistance 
f|j ("Kanr"), is used in carrying out the illustrative example described herein. This strain, which is 
only one of many that are suitable for expressing B. burgdorferi polypeptide, is available 
commercially (QIAGEN, Inc., supra). Transformants are identified by their ability to grow on 
LB plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from resistant 
25 colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and DNA 
sequencing. 

Clones containing the desired constructs are grown overnight ("O/N") in liquid culture in 
LB media supplemented with both ampicillin (100 |Xg/ml) and kanamycin (25 jig/ml). The O/N 
culture is used to inoculate a large culture, at a dilution of approximately 1 :25 to 1 :250. The cells 

30 are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. isopropyl-b-D- ' 
thiogaJactopyranoside ("EPTG")' is then added to a final concentration of 1 mM to induce 
transcription from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequendy are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the B. burgdorferi polypeptide, the cells are then stirred for 3-4 hours at 4°C-in 

35 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the supernatant 
containing the B. burgdorferi polypeptide is dialyzed against 50 mM Na-acetate buffer pH 6, 
supplemented with 200 mM NaCl. Alternatively, the protein can be successfully refolded by 
dialyzing it against 500 mM NaCl, 20% glycerol, 25 mM Tris/HCl pH 7.4, containing protease 
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inhibitors. After renaturation the protein can be purified by ion exchange, hydrophobic interaction 
and size exclusion chromatography. Alternatively, an affinity chromatography step such as an 
antibody column can be used to obtain pure 5. burgdorferi polypeptide. The purified protein is 
stored at 4° C or frozen at -80° C. 

The following alternative method may be used to purify B. burgdorferi polypeptides 
expressed in E coli when it is present in the form of inclusion bodies. Unless otherwise 
specified, all of the following steps are conducted at 4-10°C. 

1 Upon completion of the production phase of the E. coli fermentation, the cell culture is 
cooled to 4-10°C and the cells are harvested by continuous centrifugation at 15,000 rpm (Heraeus 
Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the 
amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended 
in a buffer solution containing 100 mM Tris, 50 miVI EDTA, pH 7.4. The cells are dispersed to a 
homogeneous suspension using a high shear mixer. 

The cells ware then lysed by passing the solution through a microfluidizer (Microfuidics, 
Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCl _ 
solution to a final concentration of 0.5 M NaCl ; followed by centrifugation at 7000 x g for 15 . 
min. The resultant pellet is washed again using 0.5M NaCl, 100 mM Tris, 50 mM EDTA, pH 
7.4. 

The resulting washed inclusion bodies are solubilized with L5 M guanidine hydrochloride 
(GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the 
B. burgdorferi polypeptide-containing supernatant is incubated at 4°C overnight to allow further 
GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, the 
GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 volumes of 
buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by vigorous stirring. 
The refolded diluted protein solution is kept at 4° C without mixing for 12 hours prior to further 
purification steps.* 

To clarify the refolded 5. burgdorferi polypeptide solution, a previously prepared 
tangential filtration unit equipped with 0. 16 (J.m membrane filter with appropriate surface area 
(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample 
is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive Biosystems). The column 
is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 
1500 mM NaCl in the same buffer, in a stepwise manner. The absorbance at 280 mm of the 
effluent is continuously monitored/ Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the B. burgdorferi polypeptide are then pooled and. mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem 
columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, 
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Perseptive Biosystems) exchange resins. The columns are equilibrated with 40 mM sodium 
acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. 
The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M 
NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 M NaCl,. 50 mM sodium acetate, pH 6.5. Fractions 
are collected under constant A^ monitoring of the effluent. Fractions containing the B. 
burgdorferi polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant B. burgdorferi polypeptide exhibits greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands are observed from Commassie blue 
stained 16% SDS-PAGE gel when 5 jig of purified protein is loaded. The purified protein is also 
tested for endotoxiri/LPS contamination, and typically the LPS content is less than 0.1 ng/ml 
according to LAL assays. 

3(d), Cloning and Expression of B. burgdorferi in Other Bacteria 

B. burgdorferi polypeptides can also be produced in: B. burgdorferi using the methods of 
S. Skinner et al., (1988) Mol. Microbiol. 2:289-297 or L I. Moreno (1996) Protein Expr. Purif. 
8(3):332-340; Lactobacillus. using the methods of C. Rush et al., 1997 Appl. Microbiol. 
Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods Chang et al., U.S. Patent 
No. 4,952,508. 

4. Cloning and Expression in COS Cells 

A B. burgdorferi expression plasmid is made by cloning a portion of the DNA encoding a 
B. burgdorferi polypeptide into the expression vector pDN AI/Amp or pDNAm (which can be 
obtained fromjnvitrogen, Inc.): The expression vector pDNAI/amp contains: (1) an E. coli 
origin of replication effective for propagation in E. coli and other prokaryotic cells; (2) an 
ampicillin resistance gene for selection of plasmid-containing prokaryotic cells; (3) an SV40 origin 
of replication for propagation in eukaryotic cells; (4) a CMY promoter, a polylinker, an S V40 
intron; (5) several codons encoding. a hemagglutinin fragment (i.e., an "HA" tag to facilitate 
purification) followed by a termination codon and polyadenylation signal arranged so that a DNA 
can be conveniently placed under expression control of the CMV promoter and operably linked to 
the SV40 intron and the polyadenylation signal by means of restriction sites in the polylinker. 
The HA tag corresponds to an epitope derived from the influenza hemagglutinin protein described 
by Wilson et al. 1984 Cell 37:767. The fusion of the HA tag to the target protein allows easy 
detection and recovery of the recombinant protein with an antibody that recognizes the HA 
epitope. pDNATTT contains, in addition, the selectable neomycin marker. 

A DNA fragment encoding a B. burgdorferi polypeptide is cloned into the polylinker 
region of the vector so that recombinant protein expression is directed by the CMV promoter. The 
plasmid construction strategy is as follows. The DNA from a B. burgdorferi genomic DNA prep 
is amplified using primers that contain convenient restriction sites, much as described above for 
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construction of vectors for expression of B, burgdorferi in E. coli. The 5' primer contains a 
Kozak sequence, an AUG start codon, and nucleotides of the 5' coding region of the B. 
burgdorferi polypeptide. The 3" primer, contains nucleotides complementary to the 3' coding 
sequence of the B. burgdorferi DNA, a stop codon, and a convenient restriction site. 

The PCR amplified DNA fragment and the vector, pDNAI/Amp, are digested with 
appropriate restriction enzymes and then ligated. The ligation mixture is transformed into an 
appropriate E. coli strain such as SURE™ (Stratagene Cloning Systems, La Jolla, CA 92037), 
and the transformed culture is plated on ampicillin media plates which then are incubated to allow 
growth ofampicmin resistant colonies. Plasmid DNA is isolated from resistant colonies and 
examined by restriction analysis or other means for the presence of the fragment encoding the B. 
burgdorferi polypeptide 

For expression of a recombinant B. burgdorferi polypeptide, COS cells are transfected 
p with an expression vector, as described above; using DEAE-dextran, as described, for instance, 
| by Sambrook et al. (supra). Cells are incubated under conditions for expression of B. 
l{5 burgdorferi by the vector. 

Expression of the B. burgdorferi-RA fusion protein is detected by radiolabeling and 
immunoprecipitation, using methods described in, for example Harlow et al., supra.. . To this 
end, two days after transfection, the cells are labeled by incubation in media containing 35 S- 
1 cysteine for 8 hours. The cells and the media are collected, and the cells are washed and the lysed 
|o with detergent-containing RIPA buffer: 150 mM NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% 
S4 DOC, 50 mM TRIS, pH 7.5, as described by Wilson et al. (supra ). Proteins are precipitated 
from the cell lysate and from the culture media using an HA-specific monoclonal antibody. The 
precipitated proteins then are analyzed by SDS-PAGE and autoradiography. An expression 
product of the expected size is seen in the cell lysate, which is not seen in negative controls. 
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5. Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of B. burgdorferi polypeptide in this example. 
Plasmid pC4 is a derivative of the plasmid pS V2-dhfr (ATCC Accession No. 37 146). The 
plasmid contains the mouse DHFR gene under control of the SV40 early promoter. Chinese 

30 hamster ovary cells or other cells lacking dihydrofolate activity that are transfected with these 
plasmids can be selected by growing the cells in a selective medium (alpha minus MEM, Life 
Technologies) supplemented with the chemotherapeutic agent methotrexate. The amplification of 
the DHFR genes in cells resistant to methotrexate (MTX) has been well documented. See, e.g., 
Alt et al., 1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys. 

35 . Acta, 1097:107-143; Page et al., 1991, Biotechnology 9:64-68. Cells grown in increasing 
concentrations of MTX develop resistance to the drug by overproducing the target enzyme, 
DHFR, as a result of amplification of the DHFR gene. If a second gene is linked to the DHFR 
gene, it is usually co-amplified and over-expressed. It.is known in the art that this approach may 
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be used to develop ceil lines carrying more than 1,000 copies of the amplified gene(s). 
Subsequently, when the methotrexate is withdrawn, cell lines are obtained which contain the 
amplified gene integrated into one or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) of the Rouse 
Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al. (1985) Mol. Cell. Biol. 
5:438-447; plus a fragment isolated from* the enhancer of the immediate early gene of human 
cytomegalovirus (CMV), Boshart, et al., 1985, Cell 41:521-530. Downstream of the promoter 
are the following single restriction enzyme cleavage sites that allow the integration of the genes: 
Bam HI, Xba I, and Asp 718. Behind these cloning sites the plasmid contains the 3' intron and 
polyadenylation site of the rat preproinsulin gene. Other high efficiency promoters can also be 
used for the expression, e.g., the human- B-actin promoter, the SV40 early or late promoters or the 
long terminal repeats from other retroviruses, e.g., HTV and HTLVL Clontech's Tet-Off and Tet- 
On gene expression systems and similar systems can be used to express the B. burgdorferi 
polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, Proc. Natl. Acad. Sci. 
USA 89:5547-5551. For the polyadenylation of the mRNA other signals, e.g., from the human 
growth hormone or globin genes can be used as well. Stable cell lines carrying a gene of interest ' 
integrated into the chromosomes can also be selected upon co-transfection with a selectable 
marker such as gpt, G418 or hygromycin. It is advantageous to use more than one selectable 
marker in the beginning, e.g., G418 plus methotrexate. 

The plasmid pC4 is digested with the restriction enzymes and then dephosphorylated 
using calf intestinal phosphates by procedures known in the art. The vector is then isolated from 
a 1% agarose gel. The DNA sequence encoding the B. burgdorferi polypeptide is amplified using 
PCR oligonucleotide primers corresponding to the 5' and 3' sequences of the desired portion of 
the gene. A 5' primer containing a restriction site, a Kozak sequence, an AUG start codon, and 
nucleotides of the 5' coding region of the B. burgdorferi polypeptide is synthesized and used. A 
3' primer, containing a restriction site, stop codon, and nucleotides complementary to the 3' 
coding sequence of the B. burgdorferi polypeptides is synthesized and used. The amplified 
fragment is digested with the restriction endonucleases and then purified again on a 1% agarose 
gel. The isolated fragment and the dephosphorylated vector are then ligated with T4 DNA ligase. 
E. coli HB101 or XL-1 Blue cells are then transformed and bacteria are identified that contain the 
fragment inserted into plasmid pC4 using, for instance; restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for transfection. Five 
jig of the expression plasmid pC4 is cotransfected with 0.5 jig of the plasmid pSVneo using a 
lipid-mediated transfection agent such as Lipofectin™ or LipofectAMINE.™ (LifeTechnologies 
"Gaithersburg, MD). -The plasmid pSV2-neo contains a dominant selectable marker, the.neo gene 
from Tn5 encoding an enzyme that confers resistance to a group of antibiotics including G418. 
The cells are seeded in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the 
cells are trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 
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MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml G418. After about 
10-i4 days single clones are trypsinized and then seeded in 6-well petri dishes or 10 ml flasks 
using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 800 nM). 
Clones growing at the highest concentrations of methotrexate are then transferred to new 6-well 
plates containing even higher concentrations of methotrexate (1 (oM, 2 ixM, 5 [iM, 10 mM, 20 
mM). The same procedure is repeated until clones are obtained which grow at a concentration of 
100-200 jiM. Expression of the desired gene product is analyzed, for instance,, by SDS-PAGE 
and Western blot or by reversed phase HPLG analysis. 

6. Immunization and Detection of Immune Responses 



- 6(a). B. burgdorferi propagation 

B. burgdorferi sensu stricto isolate B3 1 is propagated in tightly-closed containers at 34°C 
in modified Barbour-Stoenner-Kelly (BSKH) medium (Barbour, A.G., Yale J. Biol Med, 
57:521-525 (1984)) overlaid with a 5%O 2 /5%CO 2 /90%N 2 gas mixture. Cell densities of these 
cultures are determined by darkfield microscopy at 400X. 
jjj Immunization of Mice and Challenge with B. burgdorferi. For active immunizations 

Uj BALB/cByJ mice (BALB, Jackson Laboratories) are injected intraperitoneally (i.p.) at week 0 
Q with 20 g of recombinant borrelial protein, or phosphate -buffered saline (PBS), emulsified with 
complete Freund's adjuvant (CFA), given a similar booster immunization in incomplete Freund's 
adjuvant (IF A) at week 4, and-challenged at week 6. For challenge B. burgdorferi are diluted in 
?| BSKH from exponentially-growing cultures and mice are injected subcutaneously (s.c.) at the 
□ base of the tail with 0.1 ml of these dilutions (typically 10 3 -10 4 borreliae; approximately 10-100 
^ times the median infectious dose). Borreliae used for challenge are passaged fewer than six times 

in vitro. To assess infection, mice are sacrificed at 14-17 days post-challenge, and specimens 
25 . derived from ear, bladder, and tibiotarsal joints are placed in BSKH plus 1.4% gelatin, 13 g/ml 
amphotericin B, 1.5 g/ml phosphomycin, and 15 g/ml rifampicin, and borrelia outgrowth at two 
or three weeks is quantified by darkfield microscopy. Batches of BSKH are- qualified for 
infection testing by confirming that they supported the growth of 1-5 cells of isolate B3 1 . In 
some instances seroconversion for protein P39 reactivity is also used to confirm infections (see 
30 below). Others have previously shown that mice elicited antibodies to P39 when inoculated with 
live borreliae by syringe or tick bite, but not with killed borreliae (Simpson, W.J., et al, 7. Clin. 
Microbiol 29:236-243 (1991)). 

6(b). Immunoassays 

35 - Several immunoassay formats are used to quantify levels of borrelia-specific antibodies 

(ELISA and immunoblot), and to evaluate the functional properties of these antibodies (growth 
inhibition assay). The ELISA and immunoblot assays are also used to detect and quantify 
antibodies elicited in response to borrelial infection that react with specific borrelial antigens. 
Where antibodies to certain borrelial antigens are elicited by infection this is taken as evidence that 
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the borrelial proteins in question are expressed in vivo. Absence of infection-derived antibodies 
(seroconversion) following borrelial challenge is evidence that infection is prevented or 
suppressed. The immunoblot assay is also used to ascertain whether antibodies raised against 
recombinant borrelial antigens recognize a protein of similar size in extracts of whole borreliae. 
Where the natural protein is of similar, or identical, size in the immunoblot assay to the 
recombinant version of the same protein, this is taken as evidence that the recombinant protein is 
the product of a full-length clone of the respective gene. 

Enzyme-Linked Immunosorbant Assay (EUSA). The ELISA is used to quantify levels of 
antibodies reactive with borrelial antigens elicited in response to immunization with these borrelial 
antigens. Wells of 96 well microtiter plates (Immunlon 4, Dynatech, Chantilly, Virginia, or 
equivalent) are coated with antigen by incubating 50 1 of 1 g/ml protein antigen solution in a 
suitable- buffer, typically 0.1 M sodium carbonate buffer at pH 9.6. After decanting unbound 
antigen, additional binding sites are blocked by incubating 100 1 of 3% nonfat milk in wash 
buffer (PBS,"0.2% Tween 20, pH 7.4). After washing, duplicate / serial two-fold dilutions of sera 
in PBS, Tween 20, 1% fetal bovine serum, are incubated for 1 hr, removed, wells are washed 
three times, and incubated with horseradish peroxidase-conjugated goat anti-mouse IgG. After 
three washes, bound antibodies are detected with H 2 Q 2 and 2,2'-azino : di-(3-ethylbenzthiazoline 
sulfonate) (Schwan, T.G., et al., Proc.Natl: Acad. ScL USA 92:2909-2913 (1985)) (ABTS®, 
Kirkegaard & Perry Labs., Gaithersburg, MD) and is quantified with a Molecular Devices, 
Corp. (Menlo Park, California) Vmax™ plate reader. IgG levels twice the background level in 
serum from naive mice are assigned the minimum titer of 1: 100. 

6(c). In Vitro Growth Inhibition Assay 

Unlike other bacteria, borreliae can be killed by the binding of specific antibodies to their 
surface antigens. The mechanism for this in vitro killing or growth-inhibitory effect is riot 
known, but can occur in the absence of serum complement, or other immune effector functions. 
Antibodies elicited in animals receiving immunizations with specific borrelial antigens that result 
in protection from borrelial challenge usually will directly kill borreliae in vitro. Thus, the in vitro 
growth inhibition assay also has a high predictive value for the protective potency of the borrelial 
antibodies, although exceptions, such as antibodies. against OspC which are weak at in vitro 
growth inhibition, have been observed.. Also, this assay can be used to evaluate the serologic 
conservation of epitope binding protective antibodies. A micro well antibody titration assay 
(Sadziene, A., et al f J. Infect. Dis. 167: 165-172 (1993)) is used to evaluate the growth inhibition 
(GI) properties of antisera against recombinant borrelial antigens against the homologous B3 1 
isolate, and against various strains of bonrelia. Briefly, 10 5 borrelia in 100 1 BSKH are added to 
serial two-fold dilutions of sera in 100 1 BSKII in 96-well plates, and the plates are covered and 
incubated at 34°C in a 5%O 2 /5%CO 2 /90%N 2 gas mixture for 72 h prior to quantification of 
borrelia growth by darkfield microscopy/ 
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6(d). Sodiumdodecylsulfate-Polyacrylamide Gel Electrophoresis 
(SDS-PAGE) and Immunoblotting 

. Using a single well format, total borrelial protein extracts, recombinant borrelial antigen, 
or recombinant P39 samples (2 g of purified protein, or more for total borrelial extracts) are - 
5 boiled in SDS/2-ME sample buffer before electrophoresis through 3% acrylamide stacking gels, 
and resolving gels of higher acrylamide concentration, typically 10-15% acrylamide monomer. - 
Gels are electro-blotted to nitrocellulose membranes and lanes are probed with dilutions of 
antibody to be tested for reactivity with specific borrelial antigens, followed by the appropriate 
secondary antibody-enzyme (horseradish peroxidase) conjugate. When it is desirable to confirm 
10 that the protein had transferred following electro-blotting, membranes are stained with Ponceau S. 
Immunoblot signals from bound antibodies are detected on x-ray film as chemiluminescence using 
ECL™ reagents (Amersham Corp., Arlington Heights, Illinois). 

m ^ 

j'P 6(e). Detection of Borrelia mRNA expression 

Northern blot analysis is carried out using methods described by, among others, 
E) ' Sambrook ei al, supra, to detect the expression of the B. burgdorferi nucleotide sequences of the 
?U - present invention in animal tissues. A cDN A probe containing an entire nucleotide sequence 
W shown in Table 1 is labeled with 32 P using the retfiprime™ DNA labeling system (Amersham Life 

Science), according to manufacturer's instructions. After labeling, the probe is purified using a 
(2feo CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), according to manufacturer's 
p protocol number PT 1200-1 . The purified labeled probe is then used to detect the expression of 
Borrelia mRNA in an animal tissue sample. 

Animal tissues, such as blood or spinal fluid, are examined with the labeled probe using 
ExpressHyb™ hybridization solution (Clontech) according to manufacturer's protocol number 
25 PT1 190-1. Following hybridization and washing, the blots are mounted and exposed to film at - 
70 C overnight, and films developed according to standard procedures. 

The disclosure of all publications (including patents, patent applications, journal articles, 
laboratory manuals, books K or other documents) cited herein are hereby incorporated by reference 
in their entireties. 

30 The present invention is not to be limited in scope by the specific embodiments described 

herein, which are intended as single illustrations of individual aspects of the invention. 
Functionally equivalent methods and components are within the scope of the invention, in 
addition to those shown and described herein and will become apparant to those skilled in the art 
from the foregoing description and accompanying drawings. Such modifications are intended to 
35 * fall within the scope of the appended claims. 

' Provisional Application Serial No. 60/057,483 filed 3 September 1997 is incorporated by 
. reference herein in its entirety 
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TABLE 1 . Nucleotide and Amino Acid Sequences 



flOl.aa 

MSKIFLLFNAGFFFLKIIYVFSYPEIKNFSRQDPVFSDLKIKVLKYNKKQHIPLFFYSYKVKKGDTFFKIANKING 
WQSGIATINLLDSPAVSVGQEILIPSKKGVFVFDSKDYRFNNLL^ 

DFGLFSGTELLFFLNANFIFPLKKFIVSSDFGFRNDPFTGNKSFHTGIDLAAPMNAEVYLLLLE 
tlOl.aa ... 

SYPEIKNFSRQDPVFSDLKIKVLKYNKKQHIPLFFYSYKVKKGDTFFKIANKINGWQSGIATINLLDSPAVSVGQE 
ILIPSKKGVFVFDSKDYRFNNLLLATRDLAKAEKVKIKRNDRVYEFYFFDFVKNPDFGLFSGTELLFFLNANFIFP 
LKKFIVSSDFGFRNDPFTGNKSFHTGIDLAAPMNAEVYLLLLE 



flOl.nt 

. ATGAGTAAAATTTTTTTATTATTTAATGCAGGTTTCTTTTTTTTAAAAATAATTTATGTTTTTTCTTATCCAGAAA 
TAAAAAATTTCTCAAGGCAAGATCCTGTTTTTTCTGATCT 

TATTCCTCTGTTTTTTTACTCATATAAAGTTAAAAAAGGGGATACTTTTTTTAAAATTGCCAATAAAATAAATGGA 
TGGCAGTCCGGCATTGCTACTATTAATTTATTAGATTCTCCTGCTGTGAGTGTTGGGCAAGAGATTCTTATTCCCA 
GTAAAAAAGGAGTTTTTGTTTTTGATAGTAAAGATTATAGATTTAATAATTTGCTTTTAGCAACAAGGGATCTTGC 
TAAAGCTGAAAAGGTAAAAATTAAAAGGAACGACAGAGTTTATGAATTTTATTTTTTTGATTTTGTTAAGAATCCA 
' GATTTTGGACTTTTTTCAGGCACAGAATTGCTTTTTTTCTTAAATGCCAATTTTATTTTTCCTTTAAAAAAATTTA 
TTGTTAGTTCTGATTTTGGATTTAGAAATGACCCTTTCACTGGCAACAAAAGTTTCCATACAGGAATAGATCTTGC 
AGCTCCAATGAATGCTGAAGTGTATCTTCTTCTTCTGGAATAG - - . 



tlOl.nt 

TCTTATCCAGAAATAAAAAATTTCTCAAGGCAAGATCCTGTTTTTTCTGATCTTAAAATTAAAGTTTTAAAATATA' 

ACAAAAAAGAACATATTCCTCTGTTTTTTTACTCATATAAAGTTAAAAAAGGGGATACTTTTTTTAAAATTGCCAA 

TAAAATAAATGGATGGCAGTCCGGCATTGCTACTATTAATTTATTAGATTCTCCTGCTGTGAGTGTTGGGCAAGAG 

ATTCTTATTCCCAGTAAAAAAGGAGTTTTTGTTTTTGATAGTAAAGATTATAGATTTAATAATTTGCTTTTAGCAA 

CAAGGGATCTTGCTAAAGCTGAAAAGGTAAAAATTAAAAGGAACGACAGAGTTTATGAATTTTATTTTTTTGATTT 

TGTTAAGAATCCAGATTTTGGACTTTTTTCAGGCACAGAATTGCTTTTTTTCTTAAATGCCAATTTTATTTTTCCT 

TTAAAAAAATTTATTGTTAGTTCTGATTTTGGATTTAGAAATGACCCTTTCACTGGCAACAAAAGTTTCCATACAG 

GAATAGATCTTGCAGCTCCAATGAATGCTGAAGTGTATCTTCTTCTTCTGGAATAG 

fll -aa . - 

VKKYIKTIFLISMVYFYCCTTIKimDYETDFKVLESPSKYINIDVIKATNEYIYIQITNNSLDVVKINWQNTSLN 

NDKIVLKKEDLTINNETGYKNKYREFFIGPKTSFKFKVYPL^ 

TINVLITRTTKINITNK 

til .aa - ' • "■ - 

CCTTIKINHDYETDFKVLESPSKYINIDVIKATNEYIYIQITNNSLDVVKI^^^QNTSLNNDKIVLKKEDLTINNET 
GYKNKYREFFIGPKTSFKFKVYPLKIHSKNKNSNNLSSTIKYPSXFKLNITKVGIEAKKTI3WL 

fll.nt - 

GTGGAAAAATTTCTTTTATTCCAGGAAATGAAAATATTGCAGATCTTGGTTTTCATAAAACTAAGTAGAAATATTG 
TCAAAAAAJTACATAAAAACAATATTTCTGATTTC 

TGATT ATGAAACTGATTTTAAAGTTC T AGAATC TC C CTC T AAATAC ATC AATATAGATGTAATTAAAGCT AC AAAT 
GAATATATTTATATTCAAATTACAAACAATAGCTTAGACGTAGTAAAAATAAATTGGCAAAACACTAGTCTTAACA 
■ACGATAAGATCGTCTTAAAAAAAGAAGATCTTACAATAAACAATGAAACAGGGTATAAAAATAAATACA.GAGAGTT 
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TABLE 1. Nucleotide and Amino Acid Sequences 



TTTTATTGGTCCTAAAACTTCATTTAAATTTAAAGTATATCCACTAAAAATTCATTCTAAAAACAAAAATAGCAAT 
AACTTAAGCTCAACTATTAAATATCCGTCTATTTTTAAGCTCAACATAACAAAAGTAGGAATTGAAGCAAAAAAAA 
CAATAAATGTTTTAATAACAAGAACTACAAAAATTAATATTACTAATAAATGA 

til .nt 

TGTTGTACGACAATAAAAATAAACCATGATTATGAAACTGATTTTAAAGTTCTAGAATCTCCCTCTAAATACATCA 

ATATAGATGTAATTAAAGCTACAAATGAATATATTTATATTCAAATTACAAACAATAGCTTAGACGTAGTAAAAAT 

AAATTGGCAAAACACTAGTCTTAACAACGATAAGATCGTCTTAAAAAAAGAAGATCTTACAATAAACAATGAAACA 

GGGTATAAAAATAAATACAGAGAGTTTTTTATTGGTCCTAAAACTTCATTTAAATTTAAAGTATATCCACTAAAAA 

TTCATTCTAAAAACAAAAATAGCAATAACTTAAGCTCAACTATTAAATATCCGTCTATTTTTAAGCTCAACATAAC 

AAAAGTAGGAATTGAAGCAAAAAAAACAATAAATGTTTTAATAACAAGAACTACAAA 

TGA ■ 

fl2.aa . 

MREFLYRNVFKKSFIVFLIFLTFSNAIFAQTIDDENSKKRDKLTLSQKSYLRELELSTDEDLKKWALKEGLKETDV 

SKIRELLLKKFGIDPELFIKGKGLAGSGRYKIIIETADNLENFTYGLTKDESIIFEGRVNILVEDIKENKKHNIKG 

DRIVLNKNSKKLYAIGWEYILDMDTNEKLYFYGNEFLVDFDSQNFLLKj>JGILQKKMQKNQIDHILSFGGKVLKKI 

DNDVTILEQAFATTSKIPEPYYSIKASKIWALPSGDFGFLNAIFYMGRVPVFYIPFFFRPGDSLFFNPSLGLNPRK 

GFSVFNTVYLFGNKSSSEDSSFLDFDFNSVYNSGKKPYIRNGYLTYFFAENLAPSWKDWKLIFDIYANLG 

IDFNLGNTLGHFKTLEGNFGLGFTRNVYSYDGGYYPFDNRTLKQSLFSFSNLNKGDVFGFEVPFRYLFKFKTEFLL- 

SDALFSVVLEHYSDPYVNIDFRDRIESATFFSLLNLDKDSVKEQTSISTFDWNLSSFYKRTFNDGSILDYKLNNLG 

LSFKLSGYENLYVKSPLEKPKDVNDPTRKWFYLERIYAPYIDLNFQKDLYNNQWTFPADTKEMIMRPEIKNLEDKD 

NDKKSVKEKNTKKTTELTKDL^^ 

IKNKTDIKIHADFYNRLITFENLLYLNTIEYSPLNKDFKVEDKDKKSEHSIINQINLNLLPFIRYPLFSRSTLKFE 
NKATLYSFNKKYDSDVKSLVNKNSSIFLSDPETFYQSLTASLIYDYDYFTTELSGELKNSFEDIKASSELKLSLDF 
PYLLQEAGIGIKYYKKFKEDAMKNSGISAVQSPLEPQKPSSPYKNLEMSPALYYKIEPRYLDYFKFSFLVAYDPLI 

NRVSELSFKLNVFDFQFLFAMKDDFEYNYDPL^^ 

GWKINLQKFTDNELRSALTLKFKYTEFLEIYFSTLSINTKTFKYFKGYMDQIGLEPVNFFVDLSKSFNFFNSQDRK 
DSLFKIKKFSSGFKFNFYDWKFVGEYNLEPDLLRGSDGIYSPIWRNNFTIYISWNFFAPIKASFENNKDTNYEFII 

NRKTKK 
tl2.aa 

IFAQTIDDENSKKRDKLTLSQKSYLRELELSTDEDLKKWALKEGLKETDVSKIRELLLKKFGIDPELFIKGKGLAG 
SGRYKIIIETADNLENFT-YGLTKDESIIFEGRVNILVEDIKENKKHNIKGDRIVLNKNSKKLYAIGNVEYILDMDT 
NE1KLYFYGNEFLVDFDSQNFLLKNGILQKKMQKNQIDHILSFGGKVLKKIDNDVTILEQAFATTSKIPEPYYSIK 
ASKIWALPSGDFGFLNAIFYMGRVPVFYIPFFFRPGDSLFFNPSLGLNPRKGFSVFNTVYLFGNKSSSEDSSFLDF 
DFNSVYNSGKKPYIRNGYLTYFFAENLAPSVNKDYVKLIFDIYANLGFYSGIDFNLGNTLGHFKTLEGNFGLGFTR 
NVYSYDGGYYPFDNRTLKQSLFSFSNLNKGDVFGFEVPFRYLFKFKTEFLLSDALFSWLEHYSDPYVNIDFRDRI 
ESATFFSLLNLDKDSVKEQTSISTFDWNLSSFYKRTFNDGSILDYKLNNLGLSFKLSGYENLYVKSPLEKPKDVND 

PTRKWFYLERI YAP YIDLNFQKDL YNNQWTF PADTKEMIMRP E I KNLEDKDNDKKSVKEKNTKKTTELTKDL YI ? P 
EPITLKNIDQSDSFFIRFGINPYLRNNVFFDNYGITSPKDFNYEIKNYLFDIKNKTDIKIHADFYNRLITFENLLY 
LNTIEYSPLNKDFKVEDKDKKSEHSIINQINLNLLPFIP.YPLFSRSTLKFENKATLYSFNRKYDSDVKSLVNKNSS 
IFLSDPETFYQSLTASLIYDYDYFTTELSGELKNSFEDIKASSELKLSLDFPYLLQEAGIGIKYYKKFKEDAMKNS 
GISAVQSPLEPQKPSSPYKNLEMSPALYYKIEPRYLDYFKFSFLVAYDPLINRVSELSFKLNVFDFQFLFAMKDDF 
EYNYDPLKGDFSKIGTTTKLVPYSLDSSYKKELYVLTFFDNKLSFTLGVDVGWKINLQKFTDNELRSALTLKFKY^ 
EFLEIYFSTLSINTKTFKYFKGYMDQIGLEPVNFFVDLSKSFNFFNSQDRKDSLFKIKKFSSGFKFNFYDWKFVGE 
YNLEPDLLRGSDGI YS P IWRNNFT I Y I SWNF F AP I KAS FENNKDTNYEF I INRKTKK . . 

fl2.nt . ■ ' 

ATGCGAGAATTCCTATACAGGAATGTTTTC 
CAATTTTTGCCCAGACTATAGATGATGAAAATTCT^^ 

AAGAG AAC TTGAGC TTTC AACCG ATG AGGATTTAAAAAAATGGGCC TT AAAAGAGGGTTT AAAAGAAAC AG ATGTT 
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TCAAAAATACGAGAATTGCTTTTAAAAAAGTTTGGAATAGATCCTGAGCTTTTTATCAAAGGAAAGGGACTTGCCG 
GATCTGGTAGATATAAAATAATCATTGAAACTGCAGATAATCTTGAAAATTTCACTTATGGACTTACTAAAGATGA 
AAGTATTA.TTTTTGAAGGAAGAGTTAATATCTTGGTTGAAGATATTAAAGAAAATAAAAAGCACAATATTAAAGGC 
GACAGAATAGTCCTTAATAAGAACTCTAAAAAACTTTATGCTATTGGAAATGTTGAATATATTCTTGATATGGATA 
CCAATGAAAAGCTTTATTTTTATGGCAATGAATTTCTTGTCGATTTTGATTCTCAAAATTTTTTATTAAAAAATGG 
TATTCTTCAAAAAAAAATGCAAAAAAATCAAATAGATCATATTCTTTCGTTTGGAGGAAAGGTTTTAAAAAAGATA 
GACAATGATGTTACCATTTTGGAACAAGCTTTTGCAACAACTAGTAAAATTCCAGAGCCTTACTATTCAATCAAGG 
CTTCTAAAATATGGGCATTGCCCTCGGGAGATTTTGGGTTTTTAAATGCCATATTTTACATGGGAAGAGTTCCAGT 
ATTTTATATTCCTTTTTTTTTCAGAGCGGGAGATAGTTTGTTTTTTAATCCATCTTTAGGTCTAAATCCACGAAAA 
GGTTTTTCTGTTTTTAATACCGTTTATCTTTTTGGTAATAAATCTTCAAGTGAAGATTCTTCTTTTTTGGATTTTG 
ATTTCAATTCTGTTTATAATTCGGGTAAAAAACCTTATATAAGAAATGGATATTTAACTTATTTTTTTGCAGAAAA 
TTTAGCACCCAGTGTTAATAAAGATTATGTTAAGCTTATTTTTGACATTTATGCTAATCTGGGATTTTATTCTGGA 
ATTGATTTTAATTTGGGCAATACTTTGGGGCATTTTAAAACTTTGGAAGGAAATTTTGGATTGGGTTTTACCAGGA 
ATGTTTATAGTTACGATGGAGGATATTATCCTTTTGATAATAGGACTTTAAAACAATCTCTTTTTAGTTTTTCCAA 
TCTTAACAAAGGAGATGTATTTGGGTTTGAAGTTCCTTTTAGATATTTATTTAAATTTAAAACAGAATTTCTTTTA 
AGTGATGCACTTTTCTCGGTTGTTTTAGAGCACTATTCTGACCCGTATGTTAATATTGATTTTAGAGATAGGATAG 
AAAGTGCTACATTTTTTTCTCTTTTAAATTTAGATAAAGATTCGGTTAAAGAGCAAACTAGCATTAGCACTTTTGA 
TTGGAATTTATCTTCTTTTTATAAGCGAACATTTAATGACGGTTCGATTTTAGATTATAAATTAAATAATTTAGGT 
TTAAGTTTTAAATTGTCGGGCTATGAAAATCTTTATGTTAAATCTCCTTTAGAGAAACCAAAAGATGTTAATGATC 
CTACAAGAAAATGGTTTTATTTGGAGAGAATTTATGCTCCATATATTGATTTGAATTTTCAAAAAGATCTTTACAA 
TAACCAATGGACATTTCCAGCTGATACTAAAGAAATGATAATGCGCCCAGAAATTAAJ^AATCTAGAAGATAAAGAT 
AATGATAAAAAGAGTGTGAAGGAGAAAAATACTAAAAAAACAACAGAATTAACCAAAGATTTATATATTCCTCCAG 

• AACCAATTACTTTAAAAAATATTGATCAATCCGATTCTTTTTTTATTAGGTTTGGCATTAATCCTTATTTAAGAAA 
TAATGTTTTTTTTGATAATTATGGCATAACAAGTCCAAAGGACTTTAATTATGAAATAAAAAATTATTTATTTGAT 
ATAAAAAATAAAACGGATATAAAAATTCATGCTGATTTTTACAATCGTTTAATTACTTTTGAAAATTTATTATATC 
TTAATACTATTGAGTATAGTCCTTTAAATAAAGATTTTAAAGTTGAAGATAAAGATAAAAAAAGTGAGCACTCTAT 
TATTAACCAAATAAATTTAAACTTGCTTCCTTTTATTAGATATCGTTTATTTTCTAGAAGTACTTTAAAGTTTGAA 
AATAAGGCTACTTTATATTCATTTAATAAAAAATATGATTCTGATGTAAAATCTTTGGTTAATAAGAATAGTAGTA 
TTTTTTTATCTGATCCGGAAACTTTTTATCAAAGTTTAACAGCCTCTTTAATTTATGATTATGATTATTTTACTAC 
TGAGCTTTCAGGTGAATTAAAAAATAGTTTTGAAGATATTAAAGCTTCTTCTGAGCTTAAACTTTCTTTAGATTTT 
CCTTATTTGCTACAAGAAGCTGGGATTGGAATTAAATATTATAAAAAGTTTAAAGAAGATGCTATGAAAAACTCTG 
GAATTTCTGCTGTTCAAAGTCCTTTGGAGCCTCAAAAACCATCATCGCCTTATAAAAATTTAGAAATGTCTCCTGC 
TTTGTATTATAAAATTGAGCCGAGATATTTGGATTATTTTAAATTTAGTTTTTTAGTCGCCTATGATCCTTTGATA 
AATAGAGTTTCTGAACTTTCTTTTAAGCTTAATGTTTTTGATTTTCAATTTTTGTTTGCTATGAAAGACGACTTTG 
AATATAATTATGATCCTTTAAAAGGAGATTTTTCCAAGATTGGTACTACAACCAAACTTGTTCCATATTCTTTAGA 
TTCTAGTTACAAAAAGGAATTGTACGTTTTAACTTTTTTTGACAATAAGCTTTCTTTTACCTTGGGGGTAGATGTT 
GGTTGGAAAATAAATTTGCAGAAATTTACGGATAATGAACTTCGATCTGCATTGACTTTGAAGTTTAAATATACAG 
AATTTTTAGAAATTTACTTTTCTACTTTATCTATTAATACTAAGACTTTTAAATATTTTAAAGGGTATATGGACCA 
AATTGGTCTAGAACCTGTTAATTTCTTTGTTGATTTATCAAAATCTTTCAATTTCTTTAA.TTCTCAAGACAGAAAA 
GATTCACTTTTTAAAATTAAAAAATTTTCATCAGGCTTTAAATTCAATTTTTATGATTGGAAATTTGTTGGAGAAT 
ATAATTTAGAACCAGATTTATTAAGGGGATCTGATGGGATTTATTCTCCTATTTGGAGAAATAATTTTACAATTTA 
TATTTCTTGGAACTTTTTTGCTCCTATAAAAGCGTCATTTGAAAACAACAAAGATACAAACTACGAGTTTATTATT 

AATAGAAAAACAAAAAAATAA 

• tl2 .nt • 

ATTTTTGCCCAGACTATAGATGATGAAAATTCTAAAAAAAGGGATAAGCTAACTTTAAGTCAAAAATCTTATTTAA 
GAGAACTTGAGCTTTCAACCGATGAGGATTTAAAAAAATGGGCCTTAAAAGAGGGTTTAAAAGAAACAGATGTTTC 
AAAJVATACGAGAATTGCTTTTAJW^AAGTTTGGAATAGATCCTGAGCTTTTTATCAAAGGAA^ 

TCTGGTAGATATAAAATAATCATTGAAACTGCAGATAATCTTGAAAATTTCACTTATGGACTTACTAAAGATGAAA 
GTATTATTTTTGAAGGAAGAGTTAATATCTTGGTTGAAGATATTAAAGAAAATAAAAAGGACAATATTAAAGGCGA , 
CAGAATAGTCCTTAATAAGAACTCTAAAAAACTTTATGCTATTGGAAATGTTGAATATATTCTTGATATGGATACC 
AATGAAAAGCTTTATTTTTATGGCAATGAATTTCTTGTCGATTTTGATTCTCAAAATTTTTTATTAAAAAATGGTA 

TTCTTCAAAAAAAAATGCAAAAAAATCAAATAGATCATATT^ 

C AATGATGTT A.C C ATTTTGG AAC AAGC TTTTGC AAC AACT AGTAAAATTC C AG AGCC TTAC TATTC AATC AAGGCT 
TCTAJ^AATATGGGCATTGCCCTCGGGAGATTTTGGGTTTTTAAATGCCATATTTTACATGGGAAGAGTTCCAGTAT' 
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-TTTATATTCCTTTTTTTTTCAGACCGGGAGATAGTTTGTTTTTTAATCCATCTTTAGGTCTAAATCCACGAAAAGG 
TTTTTC TGTTTTT AAT ACCGTTTATC TTTTTGGT AATAAATC TTC AAGTGAAGATTC TTCTTTTTTGGATTTTGAT 
TTCAATTCTGTTTATAATTCGGGTAAAAAACCTTATATAAGAAATGGATATTTAACTTATTTTTTTGCAGAAAATT 
TAGCACCCAGTGTTAATAAAGATTATGTTAAGCTTATTTTTGACATTTATGCTAATCTGGGATTTTATTCTGGAAT 
TGATTTTAATTTGGGCAATACTTTGGGGCATTTTAAAACTTTGGAAGGAAATTTTGGATTGGGTTTTACCAGGAAT 
GTTTATAGTTACGATGGAGGATATTATCCTTTTGATAATAGGACTTTAAAACAATCTCTTTTTAGTTTTTCCAATC 
TTAACAAAGGAGATGTATTTGGGTTTGAAGTTCCTTTTAGATATTTATTTAAATTTAAAACAGAATTTCTTTTAAG 
TGATGCACTTTTCTCGGTTGTTTTAGAGCACTATTCTGACCCGTATGTTAATATTGATTTTAGAGATAGGATAGAA 
AGTGCTACATTTTTTTCTCTTTTAAATTTAGATAAAGATTCGGTTAAAGAGCAAACTAGCATTAGCACTTTTGATT 
GGAATTTATCTTCTTTTTATAAGCGAACATTTAATGACGGTTCGATTTTAGATTATAAATTAAATAATTTAGGTTT 
AAGTTTTAAATTGTCGGGCTATGAAAATCTTTATGTTAAATCTCCTTTAGAGAAACCAAAAGATGTTAATGATCCT 
ACAAGAAAATGGTTTTATTTGGAGAGAATTTATGCTCCATATATTGATTTGAATTTTCAAAAAGATCTTTACAATA 
ACCAATGGACATTTCCAGCTGATACTAAAGAAATGATAATGCGCCCAGAAATTAAAAATCTAGAAGATAAAGATAA 
TGATAAAAAGAGTGTGAAGGAGAAAAATACTAAAAAAACAACAGAATTAACCAAAGATTTATATATTCCTCCAGA^ 
CCAATTACTTTAAAAAATATTGATCAATCCGATTCTTTTTTTATTAGGTTTGGCATTAATCCTTATTTAAGAAATA 
ATGTTTTTTTTGATAATTATGGCATAACAAGTCCAAAGGACTTTAATTATGAAATAAAAAATTATTTATTTGATAT 
AAAAAATAAAACGGATATAAAAATTCATGCTGATTTTTACAATCGTTTAATTACTTTTGAAAATTTATTATATCTT 
AATACTATTGAGTATAGTCCTTTAAATAAAGATTTTAAAGTTGAAGATAAAGATAAAAAAAGTGAGCACTCTATTA 
TTAACCAAATAAATTTAAACTTGCTTCCTTTTATTAGATATCCTTTATTTTCTAGAAGTACTTTAAAGTTTGAAAA 
TAAGGCTACTTTATATTCATTTAATAAAAAATATGATTGTGATGTAAAATCTTTGGTTAATAAGAATAGTAGTATT 
TTTTTATCTGATCCGGAAACTTTTTATCAAAGTTTAACAGCCTCTTTAATTTATGATTATGATTATTTTACTACTG 
AGCTTTCAGGTGAATTAAAAAATAGTTTTGAAGATATTAAAGCTTCTTCTGAGCTTAAACTT.TCTTTAGAT.TTTCC 
TTATTTGCTACAAGAAGCTGGGATTGGAATTAAATATTATAAAAAGTTTAAAGAAGATGCTATGAAAAACTCTGGA 
ATTTCTGCTGTTCAAAGTCCTTTGGAGCCTCAAAAACCATCATCGCCTTATAAAAATTTAGAAATGTCTCCTGCTT 
TGTATTATAAAATTGAGCCGAGATATTTGGATTATTTTAAATTTAGTTTTTTAGTCGCCTATGATCCTTTGATAAA 
TAGAGTTTCTGAACTTTCTTTTAAGCTTAATGTTTTTGATTTTCAATTTTTGTTTGCTATGAAAGACGACTTTGAA' 

T AT AATT ATGATC C TTT AAAAGG AG ATTTTTC C AAG AT TGGT ACT AC AAC C AAAC TTGTTC C AT ATTC TTT AG ATT 
CTAGTTACAAAAAGGAATTGTACGTTTTAACTTTTTTTGACAATAAGCTTTCTTTTACCTTGGGGGTAGATGTTGG 
TTGGAAAATAAATTTGCAGAAATTTACGGATAATGAACTTCGATCTGCATTGACTTTGAAGTTTAAATATACAGAA 
TTTTTAGAAATTTACTTTTCTACTTTATCTATTAATACTAAGACTTTTAAATATTTTAAAGGGTATATGGACCAAA 
TTGGTCTAGAACCTGTTAATTTCTTTGTTGATTTATCAAAATCTTTCAATTTCTTTAATTCTCAAGACAGAAAAGA 
TTCACTTTTTAAAATTAAAAAATTTTCATCAGGCTTTAAATTCAATTTTTATGATTGGAAATTTGTTGGAGAATAT 
AATTTAGAACCAGATTTATTAAGGGGATCTGATGGGATTTATTCTCCTATTTGGAGAAATAATTTTACAATTTATA 
TTTCTTGGAACTTTTTTGCTCCTATAAAAGCGTCATTTGAAAACAACAAAGATACAAACTACGAGTTTATTATTAA 

TAGAAAAAC AAAAAAATAA 

f 129 .aa . 
MTKKLFVRVL I FL I SNNYAF^ 

NGYRRNFNFFNLLNTSNIIKSLSLFDSRPKNIKENEIILLETKMIKENPYKRYKDDDDFELKLSVTRKNNQIYLIL 
DFNFLFDQRKTFPSIYIKEEDVSTIINSFMKLQDSSFLSPQAS . 

tl29.aa J 

KDTIKDLFFIQDILIKKEKYSEVLNNA5LEGIIEIEHNGPYIKDHDSEVKLILKENGYRRNFNFFNLLNTSNIIKS 

LSLFDSRPKNIKENEIILLETKMIKENPYKRYKDDD^ 

VSTI INSFMKLQDSSFLS PQAS 

fl29.nt 

ATGACAAAAAAATTGTTTGTGAGGGT ATTAATCTTTTT AAT ATCCAAT AATT ATGCTTTTGCAAAAGACACAATCA- 

AAGATTTGTTCTTTATACAAGATATACTAATAAAAAAAGAGAAATATTCCGAGGTTCTAAATAATGCAAGCCTTGA 

AGGCATTATTGAAATTGAACATAACGGACCATACATTAAAGATCACGATTCAGAAGTTAAACTTATCCTAAAAGAA 

AACGGATATAGAAGAAATTTCAACTTTTTTAATCTTTTAAATACTAGTAATATAATCAAAAGTCTAAGCTTATTTG 

ACAGCAGACCAAAAAACATTAAAGAAAATGAAATCATATTATTAGAGACAAAAATGATTAAAGAAAATCCCT^ 

ACGATACAAAGACGATGATGATTTTG^TTAAAACTAAGTGTAACTC^ 
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GATTTCAATTTCCTATTTGATCAAAGAAAAACGTTTCCATCAATTTACATCAAAGAAGAAGATGTATCAACAATAA 
TAAACAGCTTCATGAAACTACAAGATTCAAGCTTTTTATCTCCTCAAGCTTCTTAA 

tl29'.nt " 

AAAGAC AC AATC AAAG ATTTGTTCTTTAT AC AAGAT ATAC TAAT Ai^AAAAAGAGAAAT ATT C C GAGGTTC T AAATA 
ATGCAAGCCTTGAAGGCATTATTGAAATTGAACATAACGGACCATACATTAAAGATCACGATTCAGAAGTTAAACT 
' TATCCTAAAAGAAAACGGATATAGAAGAAATTTCAACTTTTTTAATCTTTTAAATACTAGTAATATAATCAAAAGT 
CTAAGCTTATTTGACAGCAGACCAAiU^AACATTAAAGAAAATGAAATCATATTATTAGAGA 
AAAATCCC T AT AAAC GAT AC AAAGAC GATGATGATTTTGAATTAAAACTAAGTGTAACTCGAAAAAA 
TTATTTAATTCTTGATTTCAATTTCCTATTTGATCAAAGAAAAACGTTTCCATCAATTTACATCAAAGAAGAAGAT 
GTATCAACAATAATAAACAGCTTCATGAAACTACAAGATTCAAGCTTTTTATCTCCTCAAGCTTCTTAA 

fl42.aa 

MDKISILYTLINIIIMLILISIVYLCKRKNVSFTKRVFIALAIGIVFGMTIQYFYGTNSEITNETINWISILGDGY 
VRLLKMI I I PL I ITS I I SAI IKLTNSKDVGKMSLLVILTLVFTAGIAAI IGIFTALALGLTAEGLQAGTIEILQSE 
KLQKGLEILNQTTITKKITDLIPQNIFEDFAGLRKNSTIGWIFSAIIGIAALKTSIKKPES.IEFFKKIILTLQDI 
ILGVVTLILKLTPYAILALMTKITATSEIKSIIKLGEFVIASYIAIGLTFLMHMTLIAINKLNPITFIKKIFPALS 
FAFiSRSSAATIPINIEIQTKNLGVSEGIANLSSSFGTSIGQNGCAALHPAMLAIMIAPTQGINPTDISFILTLIG 
LIIITSFGAAGAGGGATTASLMVLSAMNFPVGLVGLVISVEPIIDMGRTAVNVGGSMLAGVISAKQLKQFNHNIYN 
QKELVNK 

tl42.aa v 

CKRKIWSFTKRVFIALAIGIVFGMTIQYFYGTNSEITNETINWISILGDGYVRLLKMIIIPLIITSIISAIIKLT^ 
SKDVGKMSLLVILTLVFTAGIAAIIGIFTALALGLTAEGLQAGTIEILQSEKLQKGLEILNQTTITKKITDLIPQN 
IFEDFAGLRKNSTrGWIFSAIIGIAALKTSIKKPESIEFFKKIILTLQDIILGWTLILKLTPYAILALMTKITA 
TSEIKSIIKLGEFVIASYIAIGLTFLMHMTLIAINKLNPITFIKKIFPALSFAFISRSSAAT'IPINIEIQTKNLGV 
SEGIANLSSSFGTSIGQNGCAALHP.AMLAIMIAPTQGINPTDISFILTLIGLIIITSFGAAGAGGGATTASLMVLS 
AMNFPVGLVGLVI SVE PI IDMGRTAVNVGGSMLAGVI SAKQLKQFNHNI YNQKELVNK . ' 



fl42.nt . . 

TAAGAGGTAATAATGGATAAAATAAGTATATTATATACATTAATCAATATTATAATAATGCTTATTCTAATAAGCA 
TAGTTTATCTTTGTAAA^GAAAAAATGTTTCTTTTACAAAAAGAGTGTTTATAGCGTTAGCAATCGGAATAGTATT 
TC^AATGACCATTCAATATTTTTATGGAACAAATTCAGAAATAACAAACGAAACTATAAATTGGATAAGTATTTTG 
GGCGATGGATACGTAAGGCTCCTTAAAATGATTATAATCCCCTTAATAATAACA.TCAATAATCTCTGCAATAATAA 
AACTAACCAATAGTAAAGATGTTGGGAAAATGAGCCTACTTGTAATATTAACACTAGTATTTACAGCAGGTATTGC 
TGCCATAATTGGCATTTTCACTGCTTTAGCATTGGGATTAACAGCCGAAGGACTACAAGCGGGAACCATCGAAATT 
TTACAAAGTGAAAAATTGCAAAAAGGCCTTGAAATATTAAATCAAACAACAATCACAA 

TTCCACAAAATATATTTGAAGATTTTGCAGGGCTTAGAAAAAACTCAACCATCGGGGTCGTGATATTTTCAGCTAT 
CATAGGAATAGCCGCCCTTAAAACATCTATCAAAAAGCCAGAATCAATAGAATTTTTTAAAAAAATAATATTAACA 
CTCCAAGACATAATATTAGGTGTAGTAACTTTGATTTTAAAACTAACGCCTTATGCTATATTAGCTTTAATGACAA 
AAATTACAGCAACCAGCGAAATCAAAAGCATAATAAAGCTTGGAGAATTTGTAATTGCTTCCTACATTGCCATAGG 
TCTTACATTTCTTATGCATATGACATTAATTGCAATAAATAAATTAAACCCAATTACTTTTATAAAAAAAATATTC 
CCAGCACTATCATTTGCATTCATATCTAGGTCGAGTGCTGCAACCATACCCATTAATATAGAAATTCAAACTAAAA 
ATCTGGGAGTAAGCGAAGGAATAGCAAATTTATCAAGCTCCTTTGGAACATCAATTGGGCAAAATGGTTGTGCAGC 
ACTACACCCCGCTATGCTTGCAATAATGATAGCACCAACTCAGGGAATAAACCCCACAGATATTTCATTTATACTC 
ACACTTATTGGATTAATAATAATAACTTCATTTGGAGCTGCTGGCGCTGGTGGAGGCGCAACAACAGCCTCACTAA 
TGGTGCTCTCAGCAATGAACTTTCCAGTGGGATTGGTAGGACTTGTAATATCTGTTGAGCCTATAATTGACATGGG 
•AAGAACAGCTGTTAATGTAGGCGGCTCAATGCTTGCAGGCGTTATATCTGCTAAACAGCTCAAACAATTCAACCAT 
AATATATACAACCAAAAAGAGCTTGTAAACAAATAA 
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TGTAAAAGAAAAAATGTTTCTTTTACAAAAAGAGTGTTTATAGCGTTAGCAATCGGAATAGTATTTGGAATGACCA 
TTCAATATTTTTATGGAACAAATTCAGAAATAACAAACGAAACTATAAATTGGATAAGTATTTTGGGCGATGGATA 
CGTAAGGCTCCTTAAAATGATTATAATCCCCTTAATAATAACATCAATAATCTCTGCAATAATAAAACTAACCAAT 
AGTAAAGATGTTGGGAAAATGAGCCTACTTGTAATATTAACACTAGTATTTACAGCAGGTATTGCTGCCATAATTG 
GCATTTTCACTGCTTTAGCATTGGGATTAACAGCCGAAGGACTACAAGCGGGAACCATCGAAATTTTACAAAGTGA 
AAAATTGCAAAAAGGCCTTGAAATATTAAATCAAACAACAATCACA 

ATATTTGAAGATTTTGCAGGGCTTAGAAAAAACTCAACCATCGGGGTCGTGATATTTTCAGC.TATCATAGGAATAG 

CCGCCCTTAAAACATCTATCAAAAAGCCAGAATCAATAGAATTTTTTAAAAAAATAATATTAACACTCCAAGACAT 

AATATTAGGTGTAGTAACTTTGATTTTAAAACTAACGCCTTATGCTATATTAGCTTTAATGACAAAAATTACAGCA 

ACCAGCGAAATCAAAAGCATAATAAAGCTTGGAGAATTTGTAATTGCTTCCTACATTGCCATAGGTCTTACATTTC • 

TTATGCATATGACATTAATTGCAATAAATAAATTAAACCCAATTACTTTTATAAAAAAAATATTCCCAGCACTATC 

ATTTGCATTCATATCTAGGTCGAGTGCTGCAACCATACGCATTAATATAGAAATTCAAACTAAAAATCTGGGAGTA 

AGCGAAGGAATAGCAAATTTATCAAGCTCCTTTGGAACATCAATTGGGCAAAATGGTTGTGCAGCACTACACCCCG 

CTATGCTTGCAATAATGATAGCACCAACTCAGGGAATAAACCCCACAGATATTTCATTTATACTCACACTTATTGG 

ATTAATAATAATAACTTCATTTGGAGCTGCTGGCGCTGGTGGAGGCGCAACAACAGCCTCACTAATGGTGCTCTCA. 

GCAATGAACTTTCCAGTGGGATTGGTAGGACTTGTAATATCTGTTGAGCCTATAATTGACATGGGAAGAACAGCTG 

TTAATGTAGGCGGCTCAATGCTTGCAGGCGTTATATCTGCTAAACAGCTCAAACAATTCAACCATAATATATACAA 

CCAAAAAGAGCTTGTAAACAAATAA 
fl47..aa 

MKIIIIGGTSAGTSAAAKANRLNKKLDITIYEKTNIVSFGTCGLPYFVGGFFDNPNTMISRTQE'EFEKTGISVKTN 
HEVIKVDAKNNTIVIKNQKTGTIFNN^ 

VIIGGGYIGIEMVEAAKNKRKISrVTlLIQLDKHILIDSFDEEIVTIMEEELTKKGVNLHTNE 
NKNTYQAIDAVILATGIKPDTEFLENQLKTT 

LGRIVGENLAGNHTAFKGTLGSASIKILSLEAARTGLTEKDAKKLQIKYKTIFVKDKNHTNYYPGQEDLYIKLIYE 
ENTKIILGAQAIGKNGAVIRIHALSIAIYSKLTTKELGMMDFSYSPPFSRTWDILNIAGNAAK 

.tl47.aa. 

AAAKANRLNKKLDITIYEKTNIVSFGTCGLPYFVG^ 
IKNQKTGTIFNNTYDQLMIATGAKPIIPPINN^ 

AAKNKRKNWLIQLDKHILIDSFDEEIVTIMEEELTKKGVNLHTNEFVKSLIGEKKAEGWTNKNTYQADAVIL^ 
GIKPDTEFLENQLKTTKNGAIIWEYGETSIKNIFSAGDCATIYNIVSKKNEYIPI^TTANKLGRIVGENLAGNHT 
AFKGTLGS AS I KI LS LEAARTGLTEKDAKKLQ I KYKTI FVKDKNHTNYYPGQEDL Y I KL I YEENTKI I LGAQAIGK 
NG A VI R I HAL S I A I Y S KLTTKE LGMMD FSYSPPFS RTWD I LN I AGNAAK 

fl47*ht' 

ATGAAAATAATAATTATTGGGGGCACATCAGCAGGAACTAGTGCCGCAGCTAAAGCAAACCGCTTAAACAAAi^ 

TAGACATTACTATCTATGAAAAAACAAATATTGTATCTTTTGGAACCTGTGGCCTGCCTTACTTTGTGGGGGGATT 

CTTTGACAACCCCAATACAATGATCTCAAGAACACAAGAAGAATTCGAAAAAACTGGAATCTCTGTTAAAACTAAC 

CACGAAGTTATCAAAGTAGATGCAAAAAACAATACAATTGTAATAAAAAATCAAAAAACAGGAACCATTTT^ 

ATACTTACGATCAACTTATGATAGCAACTGGTGCAAAACCTATTATTCCACCAATCAATAATATCAATCTAGAAAA 

TTTTCATACTCTGAAAAATTTAGAAGACGGTCAAAAAATAAAAAAATTAATGGATAGAGAAGAGATTAAAA 

GTGATAATTGGTGGTGGATACATTGGAATTGAAATGGTAGAAGCAGCAAAAAATAAAAGAAAAAATGTAAG 

TTGAACTAGATAAGCACATACTCATAGATTGCTTTGACGAAGAAATAGTCACAATAATGGAAGAAGAACTAACAAA 

AAAGGGGGTTAATCTTCATACAAJ^TGAGTTTGTAAAAAGTTTAATAGGAGAAAAAAAGGCAGAAGGAGTAGTAACA 

AACAAAAATACTTATCAAGCTGACGCTGTTATACTTGCTACCGGAATAAAACCTGACACTGAATTTTTAGAAAACC 

AGCTTAAAACTACTAAAAATGGAGCAATAATTGTAAATGAG 

AGGAGATTGTGCAACTATTTATAATATAGTAAGTAAAAAAAATGAATACATACCCTTGGCAACAACAGCCAACAAA 
.CTTGGAAGAATAGTTGGTGAAAATTTAGCTGGGAATCATACAGCATTTAAAGGCACATTGGGCTCAGCTTCAATTA 
AAAT AC TATCTTT AGAAGCTGC AAGAAC AGGAC TTAC AGAAAAAGATGC AAAAAA-GCTCC-^JVAT AAAAT ATAAAAC 
GATTTTTGTAAAGGACAAAAATCATACAAATTATTATCCAGGCCAAGAAGATCTTTATATTAAATTAATTTATGAG 
GAAAATACCAAAATAATCCTTGGGGCACAAGCAATAGGAAAA^TGGAGCCGTAATAAGAATTCATGCTTTATCAA 
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TTGCAATCTATTCAAAACTTACAACAAAAGAGCTAGGGATGATGGATTTCTCATATTCCCCACCCTTCTCAAGAAC 
TTGGGATATATTAAATATTGCTGGCAATGCTGCCAAATAG 

tl47.nt 

GCCGCAGCTAAAGCAAACCGCTTAAACAAAAAGCTAGACATTACTATCTATGAAAAAACAAATATTGTATCTTTTG 

GAACCTGTGGCCTGCCTTACTTTGTGGGGGGATTCTTTGAGAACCCCAATACAATGATCTCAAGAACACAAGAAGA . 

ATTCGAAAAAACTGGAATCTCTGTTAAAACTAACCACGAAGTTATCAAAGTAGATGCAAAAAACAATACAATTGTA 

ATAAAAAATCAAAAAACAGGAACCATTTTTAACAATACTTACGATCAACTTATGATAGCAACTGGTGCAAAACCTA 

TTATTCCACCAATCAATAATATCAATCTAGAAAATTTTCATACTCTGAAAAATTTAGAAGACGGTCAAAAAATAAA 

AAAATTAATGGATAGAGAAGAGATTAAAAATATAGTGATAATTGGTGGTGGATACATTGGAATTGAAATGGTAGAA 

GCAGCAAAAAATAAAAGAAAAAATGTAAGATTAATTCAACTAGATAAGCACATACTCATAGATTCCTTTGACGAAG 

AAATAGTCACAATAATGGAAGAAGAACTAACAJVAAAAGGGGGTTAATCTTCATACAAATGAGTTTGTAAAAAGTTT 

AATAGGAGAAAAAAAGGCAGAAGGAGTAGTAACAAACAAAAATACTTATCAAGCTGACGCTGTTATACTTGCTACC. 

GGAATAAAACCTGACACTGAATTTTTAGAAAACCAGCTTAAAACTACTAAAAATGGAGCAATAATTGTAAATGAGT 

ATGGCGAJVACTAGCATAAAAAATATTTTTTCTGCAGGAGATTGTGCAACTATTTATAATATAGTAAGTAAAAAAAA 

TGAATACATACCCTTGGCAACAACAGCCAACAAACTTGGAAGAATAGTTGGTGAAAATTTAGCTGGGAATCATACA 

GCATTTAAAGGCACATTGGGCTCAGCTTCAATTAAAATACTATCTTTAGAAGCTGCAAGAACAGGACTTACAGAAA 

AAGATGCAAAAAAGGTCCAAATAAAATATAAAACGATTTTTGTAAAGGACAAAAATCATACAAATTATTATCCAGG 

CCAAGAAGATCTTTATATTAAATTAATTTATGAGGAAAATACCAAAATAATCCTTGGGGCACAAGCAATAGGAAAA 

AATGGAGCCGTAATAAGAATTCATGCTTTATCAATTGCAATCTATTCAAAACTTACAACAAAAGAGCTAGGGATGA 

TGGATTTCTCATATTCCCCACCCTTCTCAAGAAGTTGGGATATATTAAATATTGCTGGCAATGCTGCCAAATAG 

f!52.aa . - , - 

MLKFEFSDRFLLFSYFVLIMFIGSLLLMLPISWEGDGKLAYIDALFTAVSAVSITGLTTVKMEGFSTFGFILIMLL 
IQLGGLGFISITTFYLLIPKKKMNLTDARIIKQYSLSNIEYNPIRILKSILFITFSIEMIGLILILICFKLRGVNI 
SFLEALFTTISAFCNAGFSMHSESIYAWRDVPEAIVVVSILIICGGLGFMV^ 

FFLIIIGAILFFFTEMHKLKAGYSMSTLIFNSIFYSISTRTAGFNYLDIMSLISGRTQIISLPFMFIGGAPGSTAGG 
IKITTFFLIVLAWKNQNGNGYIIGSYKVSIDSIRFALLFFARAIFILSFSFFMLLFFEGGSGNWKVIDLGYEVFS 
' AFGTVGLSVGVTQDLSFWGKVIIIFTMFAGRIGLFSMAVFVSRKSRFEEFTRPRQDILVG 

tl52.aa 

WEGDGKLAYIDALFTAVSAVS ITGLTTVKMEGFSTFGFI LIMLLIQLGGLGF I S ITTFYLL I PKKKMNLTDARI IK 
QYSLSNIEYNPIRILKSILFITFSIEMIGLILILICFKLRGVKISFLEALFTTISAFCNAGFSMHSESIYAWRDVP 

EAIVVVSILIICGGLGFMVYRDVNNTIKNKKKLSL^ 

IFYSISTRTAGFNYLDNSLISGRTQIISLPFMFIGGAPGSTAGGIKITTFFLIVLAWKNQNGNGYIIGSYKVSID 
SXRFALLFFARAIFILSFSFFMLLFFEGGSGNWKVIDLGYEVFSAFGTVGLSVGVTQDLSFWGKVIIIFTMFAGRI 

GLFSMAVFVSRKSRFEEFTRPRQDILVG 

r 

fl52.nt . 

ATGTTGAAATTTGAATTTAGCGACAGGTTTTTACTTTTTAGTTATTTTGTTTTAATTATGTTTATAGGCTCTCTTT 
TGTTGATGTTGCCTATTTCCTGGGAAGGTGATGGCAAATTAGCATACATTGATGCTCTTTTTACTGCTGTTTCTGC 
TGTAAGTATTACGGGCCTTACAACGGTTAAAATGGAAGGCTTTTCTACTTTTGGATTTATTTTGATAATGTTGCTA 

ATCC AGC TTGGGGG AC TTGGATTT ATAAGT ATT ACT AC TTTTT ATTTGC TTATAC C T AAAAAG AAAATGAATTTAA 
CAGATGCAAGAATAATAAAGCAGTATTCCCTTTCAAATATAGAATATAATCCTATTAGAATTTTAAAAAGCATATT 
GTTTATAACTTTTTCAATTGAAATGATAGGTTTAATATTAATACTTATTTGTTTTAAACTTAGGGGAGTGAATATT 
TCATTCTTAGAGGCTTTGTTTACGACAATTTCTGCTTTTTGCAATGCAGGTTTTTCCATGCATTCTGAGAGTATTT 
ATGCATGGCGAGATGTTCCTGAAGCTATAGTTGTGGTCTCTATTTTAATAATTTGTGGTGGGCTTGGGTTTATGGT 
CTATAGAGATGTAAATAACACTATTAAAAACAAAAAAAAACTATCGCTTCATGCCAAGATAGTTTTTTCTTTAAGC 
TTCTTTTTAATTATAATTGGTGCAATTTTATTTTTTTTTACAGAGATGCATAAATTAAAAGCTGGTTATTCAATGA 
GCACTTTAATATTTAATTCAATTTTTTATTCGATTAGTACCAGAACAGCTGGTTTTAATTATCTTGATAATTCTTT 
AATAAGCGGAAGAACTCAAATAATTTCTCTACCATTCATGTTTATTGGTGGTGCACCCGGATCAACTGCAGGAGGG 
ATTAAGATTACAACATTTTTTTTAATTGTATTGGCTGTTGTTAAAAATCAAAACGGCAATGGATATATTATTGGTT 
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CTTACAAGGTTTCAATAGATAGTATAAGATTTGCACTTTTATTTTTTGCAAGAGCTATTTTTATTTTAAGTTTTTC 
TTTTTTCATGCTTCTTTTTTTTGAGGGAGGATCTGGCAATTGGAAGGTTATTGATTTAGGTTATGAAGTATTTTCT 
GCTTTTGGAACGGTTGGTCTTTCAGTTGGAGTAACTCAGGATTTGTCATTTTGGGGGAAAGTCATTATAATTTTTA 
CTATGTTTGCAGGACGAATAGGGCTTTTTTCAATGGCTGTTTTTGTTTCAAGAAAGTCGCGTTTTGAAGAATTTAC 
AAGGCCAAGGCAAGATATTTTGGTTGGTTGA 

tl52 -nt 

TGGGAAGGTGATGGCAAATTAGCATACATTGATGCTCTTTTTACTGCTGTTTCTGCTGTAAGTATTACGGGCCTTA 
CAACGGTTAAAATGGAAGGCTTTTCTACTTTTGGATTTATTTT.GATAATGTTGCTAATCCAGCTTGGGGGACTTGG 
ATTTATAAGTATTACTACTTTTTATTTGCTTATACCTAAAAAGAAAATGAATTTAACAGATGCAAGAATAATAAAG 
CAGTATTCCCTTTCAAATATAGAATATAATCCTATTAGAATTTTAAAAAGCATATTGTTTATAACTTTTTCAATTG 
AAATGATAGGTTTAATATTAATACTTATTTGTTTTAAACTTAGGGGAGTGAATATTTCATTCTTAGAGGCTTTGTT 
TACGACAATTTCTGCTTTTTGCAATGCAGGTTTTTCCATGCATTCTGAGAGTATTTATGCATGGCGAGATGTTCCT 
GAAGCTATAGTTGTGGTCTCTATTTTAATAATTTGTGGTGGGCTTGGGTTTATGGTSTATAGAGATGTAAATAACA 
CTATTAAAAACAAAAAAAAACTATCGCTTCATGCCAAGATAGTTTTTTCTTTAAGCTTCTTTTTAATTATAATTGG 
TGCAATTTTATTTTTTTTTACAGAGATGCATAAATTAAAAGCTGGTTATTCAATGAGCACTTTAATATTTAATTCA 
ATTTTTTATTCGATTAGTACCAGAACAGCTGGTTTTAATTATCTTGATAATTCTTTAATAAGCGGAAGAACTCAAA 
TAATTTCTCTACCATTCATGTTTATTGGTGGTGCACCCGGATCAACTGCAGGAGGGATTAAGATTACAACATTTTT 
TTTAATTGTATTGGCTGTTGTTAAAAATCAAAACGGCAATGGATATATTATTGGTTCTTACAAGGTTTCAATAGAT 
AGTATAAGATTTGCACTTTTATTTTTTGCAAGAGCTATTTTTATTTTAAGTTTTTCTTTTTTCATGCTTCTTTTTT 
TTGAGGGAGGATCTGGCAATTGGAAGGTTATTGATTTAGGTTATGAAGTATTTTCTGCTTTTGGAACGGTTGGTCT 
TTCAGTTGGAGTAACTCAGGATTTGTCATTTTGGGGGAAAGTCATTATAATTTTTAGTATGTTTGCAGGACGAATA 
GGGCTTTTTTCAATGGCTGTTTTTGTTTCAAGAAAGTCGCGTTTTGAAGAATTTACAAGGCCAAGGCAAGATATTT 
TGGTTGGTTGA 

fl54.aa" 

MKINKTFILLFLFTKFSFVQAQANQILTEISPLSILSKNGKGSVYLKVSKSSDYILTLDKSSNSDFVFKIYDISNK 
KYITDKVKRRDFKIRLDKNSLYAIIYVGTKNENIKFSLTDLDFSILSSDSLKAKTSKIEKEDLFFTLKDLPVLNLT 
AKLKKYVLRIYKSNIYIAYQLENSDDIKVAEFIEDVGWFNLDSSVNRNITNIVNFDFSINSKGNLYIAFVTKSGAD 
FASELIVXKFNSRKWIDISPGHIENFGSLLNISIDLKDRLYLAYLREIRGEYKINLISNMGYGSIWTDVIHAYLSK 
GDSNVNSSNIGLISEPFLGIFYNYKSNNEIKSEFI^^ 

PIWICPLKSSRWINISPNVEMEGLSADIGLYKiSTNLFLAFEDNNNVRLIYF 
YGNQGLVISTLSSNSNELFFTLICQ 

tl54.aa 

NQILTEISPLSILSKNGKGSVYLKVSKSSDYILTLDKSSNSDFVFKIYDISNKKYITDKVKRRDFKIRLDKNSLYA 
IIYVGTKNENIKFSLTDLDFSILSSDSLKAKTSKIEKEDLFFTLKDLPVL.NLTAKLKKYVLRIYKSNIYIAYQLEN 

SDDIKVAEFIEDVGWFNLDSSVNFJNITNIVNFDF^^ 

ENFGSLLNISIDLKDRLYLAYLREIRGEYKINLISNMGYGSIWTDVIHAYLSKGDSNVNSSNIGLISEPFLGIFYN 

YKSNNEIKSEFIVNNENAWWANIPSVYMANFIKGFFDSNFNQIIMSFVSENRPIW 

GLSADIGLYKNNLFLAFEDNNI^T^LIYFKNKNWYFLNKLE 

CQ 

fl54 t nt 



ATGAAAATAAATAAGACATTCATTTTGCTATTTTTATTTACAAAATTTTCTTTTGTTCAAGCTCAAGCAAATCAAA 
TATTAACAGAAATTAGTCCTTTAAGTATTTTAAGCAAAAATGGGAAAGGAAGTGTTTACTTAAAAGTTAGCAAATC 
TTCCGATTATATTTTAACCCTAGATAAGAGTTCAAATTCCGATTTTGTTTTTAAAATTTATGACATTTCTAATAAA 
AAATATATAACCGATAAAGTAAAAAGAAGAGATTTTAAAATAAGATTAGATAAAAATTCTCTTTATGCAATAATAT 
ATGTTGGTACTAAAAATGAAAACATAAAGTTTTCGCTTACAGATTTAGATTTTTCAATTTTAAGTAGCGATTCCCT 
GAAAGCTAAAACATCTAAGATTGAAAAAGAAGATTTATTTTTTACTTTAAAAGATTTGCCTGTTTTAAATTTAACT 
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GCCAAGCTTAAAAAATATGTATTAAGGATTTATAAAAGCAATATTTATATTGCTTATCAGCTAGAAAATAGCGATG 
ATATTAAAGTTGCTGAATTTATTGAGGATGTTGGTTGGTTTAATCTTGATTCATCTGTTAATAGAAATATTACTAA 
T AT AGTT AATTTTG ATTTTTC AATTAATTC TAAAGGAAATTT AT ATATTGC TTTTGTT ACG AAATC AGGGGC TGAT 
TTTGCCAGCGAGCTTATAGTTAAAAAATTTAATAGTAGAAAATGGATTGATATTAGTCCTGGTCACATAGAAAATT 
TTGGATCTTTATTAAATATTAGCATTGATTTAAAAGATAGGTTGTATTTAGCATATTTAAGGGAAATTAGGGGTGA 
ATATAAAATTAATTTAATCTCGAATATGGGTTACGGAAGTATTTGGACCGATGTAATACATGCTTATTTAAGTAAA 
GGTGATTCTAATGTTAATTCATCAAACATTGGTTTAATATCTGAACCTTTTTTGGGCATTTTTTATAATTATAAGT 
CAAATAATGAGATTAAATCTGAATTTATTGTAAACAATGAAAATGCTTGGGTAAATGCAAATATTCCTTCTGTTTA 
TATGGCCAATTTTATTAAAGGCTTTTTTGATTCTAATTTTAATCAAATAATTATGAGTTTTGTTTCTGAAAATAGA 
CCTATTGTAAACATTTGTCCTTTGAAAAGTAGTAGATGGATTAATATAAGTCCTAATGTTGAAATGGAAGGTTTAA 
GTGCTGACATTGGGCTTTATAAAAATAATTTGTTTTTAGCTTTTGAGGACAATAATAATGTGAGATTAATTTATTT 
TAAGAATAAAAATTGGTATTTTTTAAATAAGCTTGAGAATTTTAAGAGTAATGTTAAAAGCCCTCAGATTGGAATT 
TATGGCAATCAAGGGCTTGTAATCTCTACTTTAAGCTCTAATTCCAATGAATTATTTTTTACTTTGATTTGCCAAT 

GA 

tl54.nt • 



AATCAAATATTAACAGAAATTAGTCCTTTAAGTATTTTAAGCAAAAATGGGAAAGGAAGTGTTTACTTAAAAG 
GCAAATCTTCCGATTATATTTTAACCCTAGATAAGAGTTCAAATTCCGATTTTGTTTTTAAAATTTATGACATTTC 
TAATAAAAAATATATAACCGATAAAGTAAAAAGAAGAGATTTTAAAATAAGATTAGATAAAAATTCTCTTTATGCA 
ATAATATATGTTGGTACTAAAAATGAAAACATAAAGTTTTCGCTTACAGATTTAGATTTTTCAATTTTAAGTAGCG 
ATTCCCTGAAAGCTAAAACATCTAAGATTGAAAAAGAAGATTTATTTTTTACTTTAAAAGATTTGCCTGTTTTAAA 
TTTAACTGCCAAGCTTAAAAAATATGTATTAAGGATTTATAAAAGCAATATTTATATTGCTTATCAGCTAGAAAAT 
AGCGATGATATTAAAGTTGCTGAATTTATTGAGGATGTTGGTTGGTTTAATCTTGATTCATCTGTTAATAGAAATA 
TTACTAATATAGTTAATTTTGATTTTTCAATTAATTCTAAAGGAAATTTATATATTGCTTTTGTTACGAAATCAGG 
GGCTGATTTTGCCAGCGAGCTTATAGTTAAAAAATTTAATAGTAGAAAATGGATTGATATTAGTCCTGGTCACATA 
GAAAATTTTGGATCTTTATTAAATATTAGCATTGATTTAAAAGATAGGTTGTATTTAGCATATTTAAGGGAAATTA 
* GGGGTGAATATAAAATTAATTTAATCTCGAATATGGGTTACGGAAGTATTTGGACCGATGTAATACATGCTTATTT 
AAGTAAAGGTGATTCTAATGTTAATTCATCAAACATTGGTTTAATATCTGAACCTTTTTTGGGCATTTTTTATAAT 
TATAAGTCAAATAATGAGATTAAATCTGAATTTATTGTAAACAATGAAAATGCTTGGGTAAATGCAAATATTCCTT 
CTGTTTATATGGCCAATTTTATTAAAGGCTTTTTTGATTCTAATTTTAATCAAATAATTATGAGTTTTGTTTCTGA 
AAATAGACCTATTGTAAACATTTGTCCTTTGAAAAGTAGTAGATGGATTAATATAAGTCCTAATGTTGAAATGGAA 
GGTTTAAGTGCTGACATTGGGCTTTATAAAAATAATTTGTTTTTAGCTTTTGAGGACAATAATAATGTGAGATTAA 
TTTATTTTAAGAATAAAAATTGGTATTTTTTAAATAAGCTTGAGAATTTTAAGAGTAATGTTAAAAGCCCTCAGAT 
TGGAATTTATGGCAATCAAGGGCTTGTAATCTCTACTTTAAGCTCTAATTCCAATGAATTATTTTTTACTTTGATT 

TGCCAATGA 



£157 .aa 



MKIFLKVIGRGILGRIjMVFRKNYDYLIaIjISLLIV^ igfflifivg 
KYDLKFVYSMVYPLYFLLILALIFTAFFGMTVNGARSWIGIWKLGGQPSSFGKWIILTLSKFYTEKKGYNEFFTF 

ITAFLLIFPSVILILLQPDFGTAIVYLTIFIFISFFAGIDLH^ 

IFSNPFYFRVIMGVLLLILLISVLGFFISKYGLSIKIIYFWFFASSILLVSIVFSKVLSKLMKTYQIKRFLVFLD 
PAIDAKGAGWNLNQVKIAIGSGGLLGKGFLKGPYTHANYVPSQSTDFIFSILAEEFGFLGVSTILILFFFLFFKFL 
IIMNKSQDRYMALVISGILGLLFFHTSFNVGMSLGVLPITGIPFPFLSYGGSSTITFFLAMSFYFNIESIVAMD 



tl57.aa 

RKNYDYLALISLLIVSFVGILLIYSSDYNISGSLTKNEYIKQTFWVIIGFFLIFIVGKYDLKFVYSMVYPL 
I^LIFTAFFGMTV>IGARSWIGIWKLGGQPSEFGKVVIILTLSK-FYTEKKGYNEFFTFITAFLLIFPSVILILLQPD 

FGTAIVYLTIFIFISFFAGIDLHYVLAF ALIGFFSFVFAILPVWYEYKVNM 



s 
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LISVLGFFISKYGLSIKIIYFYVFFASSILLVSIVFSKVLSKLMKTYQIXRFLVFLDPAIDAKGAGWNLNQVKIAI 
GSGGLLGKGFLKGPYTHANYVPSQSTDFIFSILAEEFGFLGVSTILILFFFLFFKFLIIMNKSQDRYMALVISGIL 
GLLFFHTSFNVGMSLGVLPITGIPFPFLSYGGSSTITFFLAMSFYFNIESIVAMD 



f 157 .nt 



"ATGAAGATATTCTTAAAGGTTATAGGCCGTGGTATATTAGGTAGATTAATGGTTTTTAGAAAAAATTATGATTATT 
TGGCTTTGATAAGCTTACTTATAGTTTCTTTTGTTGGTATATTGTTGATTTATTCTAGCGATTATAATATTAGTGG 
ATCTTTAACCAAGAATGAATATATAAAACAAACCTTTTGGGTAATTATTGGATTTTTTCTAATTTTTATAGTGGGC 
"AAATATGATTTAAAATTTGTTTATAGCATGGTATATCCTTTATATTTTTTATTAATATTGGCTTTAATTTTTACTG 
CATTTTTTGGAATGACAGTAAATGGAGCAAGATCTTGGATTGGCATATGGAAACTTGGAGGACAGCCTTCTGAATT 
TGGTAAAGTTGTTATTATTTTAACCCTTTCAAAATTTTACACTGAAAAAAAGGGTTATAATGAATTTTTTACCTTT 
ATTACTGCATTTTTATTAATTTTTCCATCGGTAAT'TCTTATATTATTGCAACCTGATTTTGGTACAGCAATAGTAT 
ATTTAACCATTTTTATATTTATTTCTTTTTTTGCAGGAATAGATTTGCACTATGTTTTAGCATTTGCGTTGATAGG 
GTTTTTTTCTTTTGTTTTTGCAATTTTACCGGTTTGGTATGAATATAAGGTGAATATGGGTAATGTATTTTATCTT 
ATTTTCTCAAATCCTTTTTATTTTAGAGTAATAATGGGAGTGCTGCTTTTAATTCTTTTGATTTCTGTTTTAGGAT 
TTTTCATTTCTAAATATGGTTTGAGTATTAAAATAATTTATTTTTATGTATTTTTTGCAAGTTCTATTTTATTAGT 
TTCAATAGTGTTTTCAAAGGTTCTTTCAAAGTTAATGAAGACTTATGAGATTAAACGGTTTTTGGTATTCTTAGAT 
CCGGCTATTGATGCTAAGGGTGCTGGTTGGAATTTAAATCAGGTTAAAATAGCAATTGGTTCTGGCGGTCTTTTGG 
GCAAAGGATTTTTAAAGGGACCTTATACCCACGCTAATTATGTGCCATCTCAAAGCACAGATTTTATTTTTTCTAT 
TCTTGCCGAAGAGTTTGGGTTTTTGGGTGTTAGCACTATTTTAATATTATTTTTTTTCCTTTTTTTTAAATTTTTG 
ATAATAATGAATAAAAGTCAAGATAGATATATGGCCTTAGTAATATCTGGAATTTTGGGACTTTTATTTTTTCATA 
CTTCTTTTAATGTTGGAATGTCTTTAGGAGTTCTTCCTATTACCGGGATTCCCTTTCCTTTTCTCTCTTATGGAGG 
TTCTTCTACTATTACATTTTTTTTAGCAATGTCTTTTTATTTTAATATTGAATCAATAGTTGCTATGGATTGA 



tl57 .nt 

AGAAAAAATTATGATTATTTGGCTTTGATAAGCTTACTTATAGTTTCTTTTGTTGGTATATTGTTGATTTATTCTA 
GCGATTATAATATTAGTGGATCTTTAACCAAGAATGAATATATAAAACAAACCTTTTGGGTAATTATTGGATTTTT 
TCTAATTTTTATAGTGGGCAAATATGATTTAAAATTTGTTTATAGCATGGTATATCCTTTATATTTTTTATTAATA 
TTC^CTTTAATTTTTACTGCATTTTTTGGAATGACAGTAAATGGAGCAAGATCTTGGATTGGCATATGGAAACTTG 
GAGGACAGCCTTCTGAATTTGGTAAAGTTGTTATTATTTTAACCCTTTCAAAATTTTACACTGAAAAAAAGGGTTA 
TAATGAATTTTTTACCTT-TATTACTGCATTTTTATTAATTTTTCCATCGGTAATTCTTATATTATTGCAACCTGAT 
TTTGGTACAGCAATAGTATATTTAACCATTTTTATATTTATTTCTTTTTTTGCAGGAATAGATTTGCACTATGTTT 
TAGCATTTGCGTTGATAGGGTTTTTTTCTTTTGTTTTTGCAATTTTACCGGTTTGGTATGAATATAAGGTGAATAT 
GGGTAATGTATTTTATCTTATTTTCTCAAATCCTTTTTATTTTAGAGTAATAATGGGAGTGCTGCTTTTAATTCTT 
TTGATTTCTGTTTTAGGATTTTTCATTTCTAAATATGGTTTGAGTATTAAAATAATTTATTTTTATGTATTTTTTG 
CAAGTTCTATTTTATTAGTTTCAATAGTGTTTTCAAAGGTTCTTTCAAAGTTAATGAAGACTTATCAGATTAAACG 
■GTTTTTGGTATTCTTAGATCCGGCTATTGATGGTAAGGGTGCTGGTTGGAATTTAAATCAGGTTAAAATAGCAATT 
GGTTCTGGCGGTCTTTTGGGCAAAGGATTTTTAAAGGGACCTTATACCCACGCTAATTATGTGCCATCTCAAAGCA 
CAGATTTTATTTTTTCTATTCTTGCCGAAGAGTTTGGGTTTTTGGGTGTTAGCACTATTTTAATATTATTTTTTTT 
CCTTTTTTTTAAATTTTTGATAATAATGAATAAAAGTCAAGATAGATATATGGCCTTAGTAATATCTGGAATTTTG 
GGACTTTTATTTTTTGATACTTCTTTTAATGTTGGAATGTCTTTAGGAGTTCTTCCTATTACCGGGATTCCCTTTC 
CTTTTCTCTCTTATGGAGGTTCTTCTACTATTACATTTTTTTTAGCAATGTCTTTTTATTTTAATATTGAATCAAT 
AGTTGCTATGGATTGA 



f 17 .aa 



MI-VFLFFSIYLIILFKRSSNSPLYFVPDTKFETLSIRIVIiSCSLLLIFFCTMLDARPSTIAVFPTPGSPISIALFL 
FLLKSIFVRVLISASLPTKGSNFLAFASAVKFLTYFPISKCSFSSRISSSNSL 
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tl7 :aa 



PLYFVPDTKFETLSIRIVLSCSLLLIFFCTMLDARPSTIAVFPTPGSPISIALFLFLLKSIFVRVLISASLPTKGS 
NFLAFASAVKFLTYFPISKCSFSSRISSSNSL 

fl7.nt 

ATGATTGTGTTTTTGTTTTTTTCAATATACTTAATTATATTATTTAAACGATCTTCAAACTCGCCTCTATATTTTG 
TTCCCGATACCAAGTTTGAAACCTTAAGCATTAGAATTGTTTTGTCTTGTAGTTTGCTACTTATTTTTTTTTGCAC 
TATGCTTGATGCAAGGCCTTCAACTATTGCTGTTTTTCCCACACCAGGTTCGCCTATTAGCATTGCACTATTTTTA 
TTTCTTCTCAAGAGTATATTTGTAAGAGTTTTAATCTCTGCTTCTGTTCCAACCAAGGGGTCTAATTTTTTGGCTT 
TTGCAAGTGCTGTTAAATTTTTGACATACTTTCCAATTTCAAAGTGCTCATTTTCAAGTCGTATTTCTTCATCAAA 

TTCTTTGTAG - 



tl7 .nt 



CCTGTATATTTTGTTCCCGATACCAAGTTTGAAACCTTAAGCATTAGAATTGTTTTGTCTTGTAGTTTGCTACTTA 
TTTTTTTTTGCACTATGCTTGATGCAAGGCCTTCAACTATTGCTGTTTTTCCCACACCAGGTTCGCCTATTAGCAT 
TGCACTATTTTTATTTCTTCTCAAGAGTATATTTGTAAGAGTTTTAATCTCTGCTTCTCTTCCAACCAAGGGGTCT 
AATTTTTTGGCTTTTGCAAGTGCTGTTAAATTTTTGACATACTTTCCAATTTCAAAGTGCTCATTTTCAAGTCGTA 

TTTCTTCATCAAATTCTTTGTAG 



f 170 .aa 



MKAFKVKNLRRFSNFIRILVIVLFLNSLLSLFVFLAGSYNIFVYNFQKFYLDLAIILSSVSFGLESTRLIFFYFLK 
NKKIKYYLILIFSFIIFFIALVFKIFLSGNK 



tl7 0 .aa 



YNIFVYNFQKFYLDLAIILSSVSFGLESTRLIFFYFLKMKKIKYYLILrFSFIIFFIALVFKIFLSGNK 



f 170 .nt 



ATGAAAGCTTTTAAAGTAAAAAATCTAAGACGTTTTTCAAATTTTATTAGAATTTTGGTTATTGTATTGTTTTTAA 
ATTCTTTGTTAAGTTTGTTCGTGTTTTTGGCTGGTTCTTACAATATTTTTGTTTACAATTTTCAGAAATTTTATCT 
TGATCTTGCTATTATTTTAAGCTCTGTTTCTTTTGGACTTGAATCTACTAGACTGATATTTTTTTATTTTTTGAAA 
AATAAAAAAATTAAGTATTATTTAATTTTAATTTTTAGTTTTATAATTTTTTTTATTGCTCTTGTTTTTAAAATTT 

TTCTTTCTGGTAATAA ... 
ATAG. ' - 



tl70 -nt 
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AC>_-.~T^TCAGAAA~TTTATCTTGATCTTGCTATTATTTTAAGCTCTGTTTCTTTTGGAC 
GA/TATCTTTTTA-TTTTTTGAJ^^ 

A-TT'GC TC TCGTTTTr^-AAATTTTTCTTTCTGGT AATAAATAG 



f 135 . aa 



~ac^tlst:'Z-l<il~:<zdi^^^ 

^sr-jltrz z la3gi zllvrvvnawlns pshkeal intdtdkiggyrlkttdnidifwlfgkrk 



- tlSa.aa 



tmhxidt:<ed^^ 

re i las g i z liltf iawln_ ? s :^<z>jli^rtqtdkiggyrlkttdnidifv^fgkrkykn- 



£15 5 .nt 



AT G AAA_AA_-.TTGA, TTA-TA.A "CT'TT AC AC TGT TTTT AT C TC AAGC ATG C AAT T T AAG T AC AAT GC AT AAAAT AGAT A 
C AAA AGAAGAjT AjTGAAJ_ATTC T A.T ATT C A^ TG AATTG AGAAAAAAATT AAATCT AAAC C ATC T AGAAAT 

AGATGATACCC^-GAA^^^ 

TTTGGC ACAACC C CAA.TGCAA_AGAA.TAC ATAJLAT AC G ATC AATCCTTTAATTTAAC AAG AGAAAT ACTGGC ATC AG 
G £ A.'TTG AA.C T T AA.CAGAG TAG T"T AA.TGC ATGGC TT AAT AGT C C AAG C C AC AAAG AAG C TC T T ATT AAT AC AG AT AC 
CGATAAAA.AGGTGGCTA^ 
TATAAGAATTGA 



tlSo .r,t 



ACAJ\TGCA?AA^ 

A.TCTAA ACC ATC T AGAAAT AGA.TGATA.C C CTTG AAAAAGTTGC AAAAGAATATGC C ATTAAACTGGGAGAAAAT AG 
AAC AATAACTCACACCCTTTC^^^ 

A-GAGAA A.T ACTOGCATC A.GGAA-TTGaIaC TT AAC AG AGT C AAGC C AC AAAGAAG 

ctcttatta_ata.caga_ta.c cga.taaaata.ggtggctatagattaaaaacgactgacaatatagatatatttgtagt 
tctttttggaaaaa.g--aaatataagaajttga 



f 1S6 .aa 



MKLKA_?24L1LVL I LZAF F 1 3 1 L-F r AFGML INSKLVDQQFNLMINL I ES IKS S FNL YI SSME EKVRVS SMYFNS AEK ^ 

FNZASKIXSXRJLSFrSDQSEIL 

LA_D?GSIZZS:J_:<3_SXi:<:r2C<GQ^ 

TTGRL L VrZ-i^/VL PGD I LTD I S A.S Y _ N I X KKT S EDL LQ KNK E I S TVYYYD P K SNKK YVG I S Q KVL LN L SNNKF I L LM 
RTSEEDFYYMSRATTI I LAI 53^/rvLLMLAIATLYLVKKL^ 

YEG L E Q LRTNZ S SY AZGV I I2vLD YL Y EN A I Q I A-NA S QNL S S G A VEQ AS T LEQMT AN I EQ I S QGV S ENT ENAATT EK 
I A vl^jTNEF/TKZGrZZS^ ~/]CAI EA_MT"/ITEKIG I IDE ITRQTNLLALNAS I EAARVGEKGKGF EWAAEVRKLADQSK 
ESAJRiiZID-A^rR-LrVASFAGZNF 
SAMS Z__ __ Z S VXI LXZ S VTYFKI EK 



TACAA~ATT r 
TTGAA" 
TTTTATAA.TTTTTTTT 



MXKLiriFTLFLS 

.fgtt?hqrl:-:3tyl 

Y30I . 
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tl96 . aa 



MLINSKLVDQQFNLMINLIESIKSSFNLYISSMESKVRVSSMYFNSAEKFNEASKIKSKRLSFISDQSEILIQTGS 
NMMVTDKEGKIVFTTAVKDNSDFGKSI^ 

ILIGMPLRDFETDNIFGYFMFLYSMDYIYRSFRGINFGILSSGRALAYDTTGRLLVHHWLPGDILTDISASYSNI 
IKKTSEDLLQKNKEISTVYYYDPKSNKKYVGISQKVLLNLSNNKFILLMRTSEDDFYYMSRATTIILAISFVFTLL 
MLAIATLYLVKKLSSSLNKILEYSERLASGNFTADINFGKWDTVELYSLYEGLEQLRTNFSSVAKGVIENLDYLYE 
NAIQIAJSIASQNLSSGAVEQASTLEQMTANIEQISQGVSENTENAATTEKIAVNTNERTKEGHKSVVKAIEAMTVIT 
EKIGIIDEITRQTNLLALNASIEAARVGEKGKGFEVVAAEVRKLADQSKESAREIIDIANRSLTVASRAGENFEQI 
VPGMEQTARLVKNISNESYKQSVQIEQFKNAIEQVSQLVQTTASSSEELSAMSEKMLESVKDLKESVDYFKIEK 



fl96.nt 

ATGAAGCTTAAAGCTAGGATGTTGCTACTTGTTCTTATTCTGATAGCATTCTTTATATCAATTTTGTTTTTTGCTT 
TTGGAATGCTTATTAATAGTAAATTGGTGGATCAACAGTTTAATCTTATGATAAATCTTATTGAAAGCATTAAAAG . 
TTCTTTTAATCTTTACATCTCTTCAATGGAAGAGAAAGTTAGGGTTAGTTCCATGTATTTCAACTCTGCTGAAAAA 
TTTAATGAGGCTAGTAAAATTAAATCCAAAAGGTTGAGCTTTATTTCAGATCAATCTGAAATTCTTATTCAAACCG 
GTAGTAATATGATGGTTAGAGACAAAGAAGGTAAAATAGTGTTTACTACGGCGGTTAAGGATAATAGTGATTTTGG 
CAAAT.CTATTGGGGATAGAGAATATTTTACAAAACTTAAGGAGTCTAATAGTATTGTTTACAATTCCTTTGTCATG 
TTGGCAGATCCCGGGTCTATTGAGGAGTCTTTACTTAAAGATATTTCCAAGATAAAAAATAAAAAAGGTCAGATTC 
CTTACATATTAATAGGTATGCCATTAAGAGATTTTGAAACAGATAACATTTTTGGTTATTTTATGTTTCTTTATTC 
AATGGATTATATATATAGGTCTTTTAGAGGGATTAATTTTGGAATACTCTCTAGCGGTCGTGCGCTAGCTTATGAT 
ACTACGGGTAGATTGTTGGTTCATCATGTAGTATTGCCAGGTGATATTTTGACTGATATTAGTGCTTCTTATTCCA 
ATATTATTAAGAAAACATCTGAAGATTTGTTGCAAAAGAATAAAGAAATTTCAACTGTTTATTATTATGATCCTAA 
AAGCAATAAGAAATATGTGGGAATTAGTCAAAAGGTGTTATTAAACTTGTCTAATAATAAATTTATTCTTTTAATG . 
AGAACTTCAGAGGACGATTTTTATTACATGTCAGGAGCTACAACTATAATCTTAGCAATTAGTTTTGTATTTACAT 
TACTTATGCTTGCTATTGCAACTCTTTATCTTGTGAAAAAGTTAAGCTCTTCTTTGAATAAGATACTGGAATATTC 
TGAGAGACTTGCTTCTGGTAATTTTACTGCTGATATTAATTTTGGCAAATGGGATACTGTAGAG'CTTTACAGTTTG 
TACGAAGGGCTTGAGCAGTTGAGAACCAATTTTTCTTCAGTTGCAAAAGGAGTTATTGAAAATCTAGATTATCTTT 
ATGAAAATGCAATTCAAATAGCAAATGCAAGCCAGAATTTAAGTTCTGGCGCTGTTGAGCAGGCTTCTACTTTAGA 
GCAAATGACAGCAAATATTGAGCAi^TTTCACAAGGTGTTTCTGAGAATACTGAAAATGCAGCTACTACTGAAAAA 
ATTGCTGTTAATACTAATGAAAGGACTAAAGAGGGGCATAAATCTGTTGTTAAGGCTATTGAGGCAATGACTGTAA 
TTACTGAAAAAATTGGAATTATTGATGAGATAACAAGGCAAACCAATTTGCTTGCTTTAAATGCCTCGATTGAAGG 
TGCACGAGTGGGAGAAAAGGGCAAGGGATTTGAAGTGGTAGCTGCTGAGGTTAGAAAGCTTGCAGATCAAAGCAAA 
GAATCAGCAAGAGAGATTATTGATATTGCAAACAGAAGTTTAACTGTTGCAAGTCGTGCTGGGGAAAATTTTGAAC 
AAATAGTTCCTGGTATGGAACAAACAGCCAGACTTGTAAAAAATATTTCTAATGAAAGTTATAAGCAAAGTGTTCA 
' AATAGAGCAATTTAAAAATGCAATAGAGCAGGTTAGTCAGTTAGTCCAAACTACAGCCTCAAGCAGTGAAGAGCTT 
. TCTGCAATGTCTGAAAAGATGTTAGAGAGTGTAAAAGATTTAAAAGAATCTGTTGATTATTTTAAGATCGAAAAGT 

AA 

tl96.nt 



ATGCTTATTAATAGTAAATTGGTGGATCAACAGTTTAATCTTATGATAAATCTTATTGAAAGCATTAAAAGTTCTT 
TTAATCTTTACATCTCTTCAATGGAAGAGAAAGTTAGGGTTAGTTCCATGTATTTCAACTCTGCTGAAAAATTTAA 
TGAGGCTAGTAAAATTAAATCCAAAAGGTTGAGCTTTATTTCAGATCAATCTGAAATTCTTATTCAAACCGGTAGT 
AATATGATGGTTACAGACAAAGAAGGTAAAATAGTGTTTACTACGGCGGTTAAGGATAATAGTGATTTTGGCAAAT 
CTATTGGGGATAGAGAATATTTTACAAAACTTAAGGAGTCTAATAGTATTGTTTACAATTCCTTTGTCATGTTGGC 
AGATCCCGGGTCTATTGAGGAGTCTTTACTTAAAGATATTTCCAAGATAAAAAATAAAAAAGGTCAGATTCCTTAC 
ATATTAATAGGTATGCCATTAAGAGATTTTGAAACAGATAACATTTTTGGTTATTTTATGTTTCTTTATTCAATGG 
ATTATATATATAGGTCTTTTAGAGGGATTAATTTTGGAATACTCTCTAGCGGTCGTGCGCTAGCTTATGATACTAC 
GGGTA.GATTGTTGGTTCATCATGTAGTATTGCCAGGTGATATTTTGACTGATATTAGTGCTTCTTATTCCAATATT 
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ATTAAGAAAACATCTGAAGATTTGTTGCAAAAGAATAAAGAAATTTCAACTGTTTATTATTATGATCCTAAAAGCA 
ATAAGAAATATGTGGGAATTAGTCAAAAGGTGTTATTAAACTTGTCTAATAATAAATTTATTCTTTTAATGAGAAC 
TTCAGAGGACGATTTTTATTACATGTCACGAGCTACAACTATAATCTTAGCAATTAGTTTTGTATTTACATTACTT 
ATGCTTGCTATTGCAACTCTTTATCTTGTGAAAAAGTTAAGCTCTTCTTTGAATAAGATACTGGAATATTCTGAGA 
GACTTGCTTCTGGTAATTTTACTGCTGATATTAATTTTGGCAAATGGGATACTGTAGAGCTTTACAGTTTGTACGA 
AGGGCTTGAGCAGTTGAGAACCAATTTTTCTTCAGTTGCAAAAGGAGTTATTGAAAATCTAGATTATCTTTATGAA 
AATGCAATTCAAATAGCAAATGCAAGCCAGAATTTAAGTTCTGGCGCTGTTGAGCAGGCTTCTACTTTAGAGCAAA 
TGACAGCAAATATTGAGCAAATTTCACAAGGTGTTTCTGAGAATACTGAAAATGCAGCTACTACTGAAAAAATTGC 
TGTTAATACTAATGAAAGGACTAAAGAGGGGCATAAATCTGTTGTTAAGGCTATTGAGGCAATGACTGTAATTACT 
GAAAAAATTGGAATTATTGATGAGATAACAAGGCAAACCAATTTGCTTGCTTTAAATGCCTCGATTGAAGCTGCAC 
GAGTGGGAGAAAAGGGCAAGGGATTTGAAGTGGTAGCTGCTGAGGTTAGAAAGCTTGCAGATCAAAGCAAAGAATC 
AGCAAGAGAGATTATTGATATTGCAAACAGAAGTTTAACTGTTGCAAGTCGTGCTGGGGAAAATTTTGAACAAATA 
GTTCCTGGTATGGAACAAACAGCCAGACTTGTAAAAAATATTTCTAATGAAAGTTATAAGCAAAGTGTTCAAATAG 
AGCAATTTAAAAATGCAATAGAGGAGGTTAGTCAGTTAGTCCAAACTACAGCCTCAAGCAGTGAAGAGCTTTCTGC 
AATGTCTGAAAAGATGTTAGAGAGTGTAAAAGATTTAAAAGAATCTGTTGATTATTTTAAGATCGAAAAGTAA 



f899.aa 



MRFIXAFLMILNQGFSNLFSLPPEDIIFESSYEVAIKKAQKLNKNVLILVGRDIKENLIKDFLNSFTNGEIIHKVS 
RKSVFLVIDKDNEIFNKINLQKSPTIFFVDSKNEQIKAAYVGAVLSSVQFDKDFLNYVMGAIKSTSVLKKQKDYEI 
NTADERTFFYKTLKGDWRLKFNGKDRKLVLFDTDLKEFLVFKDINENKLYAIPKSRIGNIYFSLLGNEEWKLFGKI 

K 



t899 .aa 
f 899 .nt 



ATGAGATTTATAATTGCATTTTTAATGATTTTAAATCAAGGATTTTCAAATTTGTTTTCTTTGCCTCCGGAAGATA 
TTATTTTTGAGAGTTCTTATGAGGTTGCAATTAAAAAAGCTCAAAAATTGAATAAAAATGTTTTAATTTTGGTTGG 
TAGAGATATTAAAGAAAATTTAATAAAAGATTTTTTAAACTCTTTTACAAATGGTGAAATTATTCACAAAGTATCT 

AGAAAAAGTGTTTTTTTAGTTATTdATAAGGATAATC 

TTTTTTTTGTTGATTCTAAGAATGAGCAAATAAAGGCAGCTTATGTGGGAGCTGTTTTGAGCAGTGTTCAATTTGA 
TAAGGATTTTTTAAACTATGTTATGGGAGCTATAAAATCAACAAGTGTTTTAAAAAAGCAAAAAGATTATGAAATT 
AATACTGCTGATGAGAGAACCT.TTTTTTACAAAACATTAAAAGGTGATTGGCGATTAAAGTTTAATGGTAAAGAGA 
GA^GCTTGTTCTTTTTGATACAGATCTTAAAGAATTTTTAGTTTTTAAAGATATTAATGAAAACAAGCTTTATGC 
TATTCCTAAGTCTAGGATTGGTAATATTTATTTTTCATTATTGGGAAATGAAGAATGGAAGCTTTTTGGAAAAATA 

AAATAA 



t899 .nt 



TTGCCTCCGGAAGATATTATTTTTGAGAGTTCTTATGAGGTTGCAATTAAAAAAGCTCAAAAATTGAATAAAAATC 
TTTTAATTTTGGTTGGTAGAGATATTAAAGAAAATTTAATAAAAGATTTTTTAAACTCTTTTACAAATGGTGAAAT 
TATTCACAAAGTATCTAGAAAAAGTGTTTTTTTAGTTATTGATAAGGATAATGAAATTTTTAATAAAATTAATCTA 
CAAAAAAGTCCGACTATTTTTTTTGTTGATTCTAAGAATGAGCAAATAAAGGCAGCTTA.TGTGGGAGCTGTTTTGA 
GCAGTGTTCAATTTGATAAGGATTTTTTAAACTATGTTATGGGAGCTATAAAATCAACAAGTGTTTTAAAAAAGCA 
AAAAGATTATGAAATTAATACTGCTGATGAGAGAACCTTTTTTTACAAAACATTAAAAGGTGATTGGCGATTAAAG 
TTTAATGGTAAAGACAGAAAGCTTGTTCTTTTTGATACAGATCTTAAAGAATTTTTAGTTTTTAAAGATATTAATG 
AAAACAAGCTTTATGCTATTCCTAJ^GTCTAGGATTGGTAATATTTA.TTTTTCATTATTGGGAAATGAAGAATGGAA 

GCTTTTTGGAAAAATAAAATAA 
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f924.aa 

MQDRKFSFRKYFLISVFLIFIVSGITYFYSTQMLEKSQKCVEDNLDAKVKLVDMEDFYFDLNECLNMDDFFIPRPD 
FLNENLNKNLWDGLIKNKFLDENFFKDLWIKKENLFNVDIEKENEKLIDKILEISK 



t924.aa 

TQMUEKSQKCVEDNLDAKVTCLVDMEDFYFDLNEC 
IKKENLFNVDIEKENEKLIDKILEISK - 

f924.nt 



ATGCAAGATAGAAAGTTTAGTTTTAGAAAATATTTTTTAATTTCAGTATTTTTGATTTTTATTGTTTCTGGTATTA 
CTTATTTCTATTCAACACAAATGTTGGAAAAATCTCAAAAGTGTGTTGAAGACAATTTAGACGCTAAGGTTAAATT 
AGTTGATATGGAAGATTTTTATTTTGATTTAAATGAATGTCTAAATATGGATGATTTTTTTATTCCAAGACCTGAT 
TTTTTAAATGAAAATTTAAATAAGAATTTAGTTGTTGATGGATTGATTAAAAATAAATTTCTTGATGAGAATTTTT 
TCAAGGATCTTTGGATTAAAAAGGAAAATTTATTTAACGTTGATATTGAGAAGGAGAATGAAAAATTAATAGATAA 

GATTTTAGAAATTTCCAAATGA 



t924 .nt 



ACACAAATGTTGGAAAAATCTCAAAAGTGTGTTGAAGACAATTTAGACGCTAAGGTTAAATTAGTTGATATGGAAG 
ATTTTTATTTTGATTTAAATGAATGTCTAAATATGGATGATTTTTTTATTCCAAGACCTGATTTTTTAAATGAAAA 
TTTAAATAAGAATTTAGTTGTTGATGGATTGATTAAAAATAAATTTCTTGATGAGAATTTTTTCAAGGATCTTTGG 
ATTAAAAAGGAAAATTTATTTAACGTTGATATTGAGAAGGAGAATGAAAAATTAATAGATAAGATTTTAGAAATTT 

CCAAATGA ' 

f 925 .aa . * _ 

MIRKYLXYISLLFIVFEVYSKPAFISQDDSYELDFSSGEVDISVNTNSKFNLSFKDESWIYIKSIENEAFIKLIGE - 

SYDNGAVFTFQTFKKEGKIKLVFTYQNVKD^ 

ISGATSKEIIVRAlilLSYI^YKGAIDL^^ 

RLAIELNIKEEVIjENARYLVEKNVDFSESIYLEIFEFLVTRGEHEFALNFSSLYFPKYINSSFSDKYSYLLGKLYE 
SESKHKDFLKALHYYKLVIDNYPFSYYYERAKIRYLFLKRFF 

t925 .aa ' ' 

KPAFISQDDSYELDFSSGEVDISVNTNSKFNLSFKDESWIYIKSIENEAFIKLIGESYDNGAVFTFQTFKKEGKIK 
LVFTYQISTV1CDSSEFNKIIILKITKNFEVAIPQGVGGGSSRDNNIETGNNLELGGGSISG 

DYKGAIDLLNKYNFNDDKYILLKAEIHYKNGDYLKSYENYLKLKSKYFQSXVFDLIRLAIELNIKEEVLENARYLV 
EKNVDFSESIYLEIFEFLVTRGEHEFALNFSSLYFPKYINSSFSDKYSYLLGKLYESESKHKDFLKALHYYKLVID 

NYPFSYYYERAKIRYLFLKRFF 
f925.nt 

ATGATTAGAAAATATTTGATTTATATAAGTTTGCTATTTATTGTTTTTGAAGTTTACTCTAAGCCAGCTTTTATAA 
GTCAAGACGATTCGTATGAGCTTGATTTTAGTAGTGGAGAGGTAGATATTAGTGTAAATACCAATTCAAAATTTAA 
TCTTTCTTTTAAAGATGAGTCTTGGATTTATATCAAAAGCATTGAAAATGAAGCTTTTATTAAGTTAATTGGAGAA 
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TCTTATGATAACGGTGCTGTTTTTACTTTTCAGACTTTTAAAAAAGAAGGCAAAATTAAATTGGTTTTCACTTATC 
AAAATGTTAAAGATTCAAGTGAATTTAATAAAATAATTATCTTGAAAATTACAAAGAATTTTGAAGTTGCAATTCC 
ACAAGGCGTTGGTGGTGGCTCTAGCAGGGACAATAACATTGAAACTGGTAATAATCTTGAACTTGGGGGGGGGAGT 
ATTAGCGGGGCAACTTCTAAAGAGATTATTGTTAGGGCTTTAAATTTGTCCTACATAAATGATTACAAAGGAGCAA 
TAGATTTGCTTAATAAGTATAATTTCAATGACGATAAATATATTTTATTGAAGGCGGAAATTCATTATAAAAATGG 
TGATTATTTAAAATCTTATGAAAATTATTTGAAATTGAAGAGTAAATATTTTCAAAGCATTGTTTTTGATCTAATT 
AGGCTTGCTATAGAATTAAATATTAAAGAAGAGGTTTTAGAGAACGCTAGATATTTAGTTGAAAAGAATGTTGATT 
TTTCTGAGAGCATTTATCTTGAGATCTTTGAATTCTTAGTAACAAGGGGAGAGCATGAGTTTGCTTTAAATTTTAG 
CTCTCTTTACTTTCCTAAGTATATTAATTCAAGCTTTTCAGACAAATATAGTTATTTGTTGGGAAAACTTTATGAG 
TCTGAGAGCAAGCATAAAGATTTTTTAAAGGCTTTGCATTACTATAAATTGGTTATTGATAATTACCCTTTTAGTT 
ATTATTATGAGAGAGCCAAGATAAGATATTTATTTTTAAAGCGGTTTTTTTAG 

t925.nt 

AAGCCAGCTTTTATAAGTCAAGACGATTCGTATGAGCTTGATTTTAGTAGTGGAGAGGTAGATATTAGTGTAAATA 
CCAATTCAAAATTTAATCTTTCTTTTAAAGATGAGTCTTGGATTTATATCAAAAGCATTGAAAATGAAGCTTTTAT 
TAAGTTAATTGGAGAATCTTATGATAACGGTGCTGTTTTTACTTTTCAGACTTTTAAAAAAGAAGGCAAAATTAAA 
TTGGTTTTCACTTATCAAAATGTTAAAGATTCAAGTGAATTTAATAAAATAATTATCTTGAAAATTACAAAGAATT 
TTGAAGTTGCAATTCCACAAGGCGTTGGTGGTGGCTCTAGCAGGGACAATAACATTGAAACTGGTAATAATCTTGA 
ACTTGGGGGGGGGAGTAtTAGCGGGGCAACTTCTAAAGAGATTATTGTTAGGGCTTTAAATTTGTCCTACATAAAT 
GATTACAAAGGAGCAATAGATTTGGTTAATAAGTATAATTTCAATGACGATAAATATATTTTATTGAAGGCGGAAA 
TTCATTATAAAAATGGTGATTATTTAAAATCTTATGAAAATTATTTGAAATTGAAGAGTAAATATTTTCAAAGCAT 
TGTTTTTGATCTAATTAGGCTTGCTATAGAATTAAATATTAAAGAAGAGGTTTTAGAGAACGCTAGATATTTAGTT 
GAAAAGAATGTTGATTTTTCTGAGAGCATTTATCTTGAGATCTTTGAATTCTTAGTAACAAGGGGAGAGCATGAGT 
TTGCTTTAAATTTTAGCTCTCTTTACTTTCCTAAGTATATTAATTCAAGCTTTTCAGACAAATATAGTTATTTGTT 
GGGAAAACTTTATGAGTCTGAGAGCAAGCATAAAGATTTTTTAAAGGCTTTGCATTACTATAAATTGGTTATTGAT 
AATTACCCTTTTAGTTATTATTATGAGAGAGCCAAGATAAGATATTTATTTTTAAAGCGGTTTTTTTAG 



f929 .aa 



MTKVLWSAIALLSKDKELIPFYKFLFLFFFFTLLACSKVSKDFIVFNKDVKTSSRIDNPNSNVLEVNKMEDFFGD 
IIDLKGYKILSVQQENLNLDWFEQWLAQNFSNLNAYLFIIGFDPKIKAGTILFKTQIDIDPK^SYNMYLEDITG 
D YDFNI VIQGFLKDKS VL YVFQKSVLNDVS S YRP I F FDKVNGTVLINKYARS S AYEENRSRES YP I S LEKYEKVGE 
DLIISKIEKYEYSNVQGRYCLSSVSEKVGKIDNNIYKT^ 

FERQSSEINLFRKNSQEVAKIEYISKPAYNTLNVSAKSLFSDLIVYNFWIKIVDKENIEIKIDTSTNSYDNSGFSG 
TFKRFDENVLNVKKGSSDIYFIPSGNYVYKDKIYDFSYPHLTYIDENKIYYGXFNIFPLKNNFVLEYEIDMGSYKL 
VESFFLEHSERIVQKQKFSTIILNPIKILKDD^SLVKGQKLKLER'ISKI 



t92 9 .aa 



KDKELIPFYKFLFLFFFFTLLACSKVSKDFIVFNKDV^^ 
ENLNLDVYFEQVVLAQNFSNLNAYLFIIGFDPKIKAG 

KSVLYVFQKSVLNDVSSYRPIFFDKVNGTVLINKYARSSAYEENRSRESYPISLEKYEKVGEDLIISKIEKYEYSN 

VQGRYCLSSVSEKVGKIDNNIYKTLKNLSKDEVYKFLKGVT^TYDVKDYNKMHVKDID 

SQEVAKIEYISKPAYNTLiWSAKSLFSDLIVYNFWIKIW 

GSSDIYFIPSGNYVYKDKIYDFSYPHLTYIDENKIYYGIFNIFPLKNNFVLEYEIDMGSYKLVESFFLEHSERIVQ 
KQKFSTIILNPIKILKDDVSLVKGQKLKLERIEKI ' " 



f 929 .nt 



72 

TABLE 1 . Nucleotide and Amino Acid Sequences 

ATGACAAAGGTTTTGGTTGTTAGTGCGATTGCTCTTCTGAGTAAGGATAAAGAATTAATCCCATTTTATAAATTTT 

TGTTTTTATTCTTTTTTTTTACATTACTTGCTTGTTCCAAGGTAAGCAAAGATTTTATTGTTTTTAACAAAGA.TGT 

AAAGACTTCTTCCAGGATCGATAATCCAAATTCCAATGTTTTAGAAGTTAATAAAATGGAAGATTTTTTTGGAGAT 

ATTATAGATTTAAAAGGTTATAAAATTCTTTCAGTTCAGCAGGAAAATTTAAATTTAGATGTGTATTTTGAGCAGG 

TGGTTTTAGCTCAAAATTTTTCAAATCTTAATGCATATTTGTTTATTATTGGTTTTGATCCTAAAATTAAAGCTGG . 

AACGATTCTTTTTAAAACTCAAATAGATATTGATCCAAAAAATTCTTATAACATGTATCTTGAAGATATTACAGGT 

GATTATGATTTTAATATAGTTATTCAAGGATTTTTAAAAGATAAATCTGTTTTGTATGTTTTTCAAAAATCTGTTT 

TAAATGATGTGTCTTCTTATAGGCCTATATTTTTTGACAAAGTTAATGGAACTGTTCTTATTAATAAGTATGCAAG 

ATCTTCAGCTTATGAAGAAAACAGATCAAGAGAAAGCTATCCTATTTCTTTAGAAAAATATGAAAAAGTGGGGGAA 

GATTTAATAATTAGCAAGATTGAAAAATATGAATATTCT-AATGTTCAGGGTAGATATTGTCTTTCTTCTGTGAGCG 

AAAAAGTTGGTAAAATTGATAATAATATTTATAAAACTTTAAAGAATTTAAGCAAAGATGAAGTTTATAAATTTTT 

GCATGGAGTTTGGTATGATGTTCATGACTATAATAAAATGCATGTCAAAGATATTGATGAAGTTTTATTCTTGTCT 

TT.TGAAAGGCAATCAAGCGAGATTAATCTTTTCAGGAAAAATTCTCAAGAAGTTGCAAAGATTGAATATATTTCAA 

AACCTGCTTACAATACTCTTAATGTTAGTGCAAAGTCTCTTTTTTCAGATTTGATAGTTTATAACTTTTGGATCAA 

AATTGTAGATAAAGAAAACATTGAAATCAAAATTGACACTAGCACAAATTCTTATGATAATAGTGGATTTTCGGGT 

ACATTTAAGAGGTTTGATGAGAATGTCTTAAATGTTAAAAAAGGGAGTAGTGATATTTATTTTATTCCTAGTGGAA 

ATTACGTGTATAAGGATAAAATTTATGATTTTTCTTACCCCCATTTAACTTATATTGATGAGAATAAAATTTATTA 

TGGCATTTTTAATATTTTTCCTTTAAAAAATAATTTTGTTCTTGAATATGAGATTGACATGGGTAGTTACAAGCTT 

GTTGAATCTTTTTTCCTTGAGCATAGCGAAAGAJ^TTGTTCAAAAGCAAAAATTTTCTACAATCATTTTAAATCCTA 

TTAAAATTTTAAAAGATGATGTAAGCTTAGTTAAAGGGCAAAAATTAAAGCTTGAGCGAATAGAAAAAATATGA 

t929.nt . ' - - 

AAGGATAAAGAATTAATCCCATTTTATAAATTTTTGTTTTTATTCTTTTTTTTTACATTACTTGCTTGTTCCAAGG 
TAAGCAAAGATTTTATTGTTTTTAACAAAGATGTAAAGACTTCTTCCAGGATCGATAATCCAAATTCCAATGTTTT 
AGAAGTTAATAAAATGGAAGATTTTTTTGGAGATATTATAGATTTAAAAGGTTATAAAATTCTTTCAGTTCAGCAG 
GAAAATTTAAATTTAGATGTGTATTTTGAGCAGGTGGTTTTAGCTCAAAATTTTTCAAATCTTAATGCATATTTGT 

.TTATTATTGGTTTTGATCCTAAAATTAAAGCTGGAACGATTCTTTTTAAAACTCAAATAGATATTGATCCAAAAAA 
TTCTTATAACATG.TATCTTGAAGATATTACAGGTGATTATGATTTTAATATAGTTATTCAAGGATTTTTAAAAGAT 
AAATCTGTTTTGTATGTTTTTCAAAAATCTGTTTTAAATGATGTGTCTTCTTATAGGCCTATATTTTTTGACAAAG 
TTAATGGAACTGTTCTTATTAATAAGTATGCAAGATCTTCAGCTTATGAAGAAAACAGATCAAGAGAAAGCTATCC 
f ATTTCTTTAGAAAAATATGAAAAAGTGGGGGAAGATTTAATAATTAGCAAGATTGAAAAATATGAATATTCTAAT 
GTTCAGGGTAGATATTGTCTTTCTTCTGTGAGCGAAAAAGTTGGTAAAATTGATAATAATATTTATAAAACTTTAA 
AGAATTTAAGCAAAGATGAAGTTTATAAATTTTTGCATGGAGTTTGGTATGATGTTCATGACTATAATAAAATGCA 

*TGTCAAAGATATTGATGAAGTTTTATTCTTGTCTTTTGAAAGGCAATCAAGCGAGATTAATCTTTTCAGGAAAAAT 
TCTCAAGAAGTTGCAAAGATTGAATATATTTCAAAACCTGCTTACAATACTCTTAATGTTAG'TGCAAAGTCTCTTT 
TTTCAGATTTGATAGTTTATAACTTTTGGATCAAAATTGTAGATAAAGAA^ACATTGAAATCAAAATTGACACTAG 
CACAAATTCTTATGATAATAGTGGATTTTCGGGTACATTTAAGAGGTTTGATGAGAATGTCTTAAATGTTAAAAAA 
GGGAGTAGTGATATTTATTTTATTCCTAGTGGAAATTACGTGTATAAGGATAAAATTTATGATTTTTCTTACCCCC 

' ATTTAACTTATATTGATGAGAATAAAATTTATTATGGCATTTTTAATATTTTTCCTTTAAAAAATAATTTTGTTCT 
TGAATATGAGATTGACATGGGTAGTTACAAGCTTGTTGAATCTTTTTTCCTTGAGCATAGCGAAAGAATTGTTCAA 
AAGCAAAAATTTTCTACAATCATTTTAAATCCTATTAAAATTTTAAAAGATGATGTAAGCTTAGTTAAAGGGCAAA 

AAT.TAAAGCTTGAGCGAATAGAAAAAATATGA 



f933.aa 

MNKLLIFVLATFCVFSSFAQANDSKNGAFGMSAGEKLLVYETSKQDPIVPFLLNLFLGFGIGSFAQGDILGGSLIL 

GFDAVGIGLILAGAYLDIKALIXSITKKAAFQWTWGKGVM^^ 

GGFEPSFDVAMGQSSALGFELSFKKSY 



t933 .aa 
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NDSKNGAFGMSAGEKLLVYETSKQDPIVPFLLNLFLGFGIGSFAQGDILGGSLILGFDAVGIGLILAGAYLDIKAL 

DGITKKAAFQWTWGKGVMLAGWTiyLAVTRL^ 

SFKKSY 



f 933 .nt 



ATGAATAAACTTTTAATTTTTGTTTTGGCAACCTTTTGTGTTTTTTCTAGCTTTGCTCAAGCTAATGATTCTAAAA 
ATGGTGCGTTTGGGATGAGT.GCTGGAGAAAAACTTTTGGTTTATGAAACTAGCAAGCAAGATCCTATTGTACCATT 
TTTATTGAACCTTTTTTTAGGGTTTGGAATAGGCTCCTTTGCTCAAGGAGATATTCTTGGAGGTTCTCTTATTCTT 
GGATTTGATGCGGTTGGTATAGGGCTTATACTTGCGGGGGCTTATTTGGATATCAAAGCGCTTGATGGTATTACTA 
AAAAAGCTGCTTTTCAATGGACTTGGGGTAAGGGAGTTATGTTAGCAGGTGTGGTTACTATGGCTGTGACAAGATT 
AACAGAAATTATTCTTCCATTTACATTTGCTAATAGTTATAATAGGAAGCTAAAAAATAGCCTTAATGTAGCTTTA 
GGAGGATTTGAACCTAGTTTTGATGTTGCAATGGGCCAATCCAGTGCTCTTGGGTTTGAACTGTCTTTCAAAAAAA 
GCTATTAA 



t933 .nt 



AATGATTCTAAAAATGGTGCGTTTGGGATGAGTGCTGGAGAAAAACTTTTGGTTTATGAAACTAGCAAGCAAGATC 
CTATTGTACCATTTTTATTGAACCTTTTTTTAGGGTTTGGAATAGGCTCCTTTGCTCAAGGAGATATTCTTGGAGG 
TTCTCTTATTCTTGGATTTGATGCGGTTGGTATAGGGCTTATACTTGCGGGGGCTTATTTGGATATCAAAGCGCTT 
GATGGTATTACTAAAAAAGCTGCTTTTCAATGGACTTGGGGTAAGGGAGTTATGTTAGCAGGTGTGGTTACTATGG 
CTGTGACAAGATTAACAGAAATTATTCTTCCATTTACATTTGCTAATAGTTATAATAGGAAGCTAAAAAATAGCCT 
TAATGTAGCTTTAGGAGGATTTGAACCTAGTTTTGATGTTGCAATGGGCCAATCCAGTGCTCTTGGGTTTGAACTG 
TCTTTCAAAAAAAGCTATTAA * " * 



f 940 .aa 



MRKYIFI ILIAVLLIGVNIKKIAAAANIDRHTNSTLGIDLSVGIPIFYNDLSKAYPTNLYPGGIGAIKYQYHILMN 
LAIGLELRYMFNFDINHSFNILNPDSSVGKIFYSVPITF5INYI-FDIGELFQIPVFTNIGFSLNTYGDRNNNITNL . 
RTFDALPTISFGSGILWNFNYKWAFGATASWWyM^ 



t940 .aa 



ANIDRHTNSTLGIDLSVGIPIFYNDLSKAYPTNLYPGGIGAIKYQYHILNNLAIGLELRYMFNFDINHSFNILNPD 

SSVGKIFYSVPITFSIbTYIFDIGELFQIPVFTNIGFSLNTYGDRNNNIT^ 

GATASWWMMFEFGNSAKMAHFALVSLSVTVNVNKL 



£940. nt- 

ATGAGAAAGTATATTTTTATAATACTAATTGCAGTCTTGCTAATTGGTGTAAACATAAAAAAAATTGCGGCCGCAG 
C C AAT ATTGATAGGC ATAC AAA.C TC C AC TTTAGG AAT AGATTT AAGTGT AGGAATCCC TATTTTTTAC AACG ACTT 
ATC AAAAGC TT ATC CT AC C AATTT AT ATC C AGGAGGT ATTGGGGC AAT AAAATAC C AGT AC C AT ATTTTAAAC AAT 
TTAGCAATTGGACTTGAACTAAGGTATATGTTTAACTTTGATATTAACCATTCTTTTAATATATTAAATCCAGATT 
CAAGTGTAGGTAAAATTTTTTATAGCGTGCCTATTACATTTTCAATAAATTATATATTTGATATAGGAGAATTATT 
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TCAAATTCCAGTCTTCACAAATATAGGGTTTTCTCTTAATACATATGGAGATAGAAACAACAATATTACAAATTTA 
AGAACTTTTGATGCACTCCCTACAATCTCTTTTGGATCTGGAATTTTATGGAACTTTAACTATAAATGGGCTTTTG 
GAGCAACAGCATCTTGGTGGATGATGTTTGAATTTGGAAATTCTGCTAAAATGGCACATTTTGCACTTGTATCATT 
ATCAGTTACAGTGAATGTAAATAAATTGTAG 



t940.nt 

t 

GCCAATATTGATAGGCATACAAACTCCACTTTAGGAATAGATTTAAGTGTAGGAATCCCTATTTTTTACAACGACT 
TATCAAAAGCTTATCCTACCAATTTATATCCAGGAGGTATTGGGGCAATAAAATACCAGTACCATATTTTAAACAA 
TTTAGCAATTGGACTTGAACTAAGGTATATGTTTAACTTTGATATTAACCATTCTTTTAATATATTAAATCCAGAT 
TCAAGTGTAGGTAAAATTTTTTATAGCGTGCCTATTACATTTTCAATAAATTATATATTTGATATAGGAGAATTAT 
TTCAAATTCCAGTCTTCACAAATATAGGGTTTTCTCTTAATACATATGGAGATAGAAACAACAATATTACAAATTT 
AAGAACTTTTGATGCACTCCCTACAATCTCTTTTGGATCTGGAATTTTATGGAACTTTAACTATAAATGGGCTTTT 
GGAGCAACAGCATCTTGGTGGATGATGTTTGAATTTGGAAATTCTGCTAAAATGGCACATTTTGCACTTGTATCAT 
TATCAGTTACAGTGAATGTAAATAAATTGTAG 



f 943 .aa 



% MKNQFLNSYFQLITTIFLISSITIAAEEITSTLKVPNGFKVEIFLNNTIEKPRGITSDQDGNIFIGSGSTFAYFVT 

KNRKIYTIAKTLQKPIGIDYWDNKLYISSVDKIYWKNVKEEINKSIKSHKDYTWKMQIF 

KVDSKNNKLIVNIGSQHNVKIPPKKEAVILSINL 

PDEIOTITEYKEHFGFPWFGKNQKNYGFYNKAPKNTKFIPSIYELPAHVAPLGIHFYRGNNFPKEYINKLFIA^ 
GSWNRSSPVGYKITTLDIDSKTRTARNYKTFLYGFLKHDKSKFGRPVDIITYYDGSILFTDDFGNKIYRVYYEKI 



t943 .aa 



EITSTLKVPNGFKVEIFLNNTIEKPRGITSDQDGNIFIGSGSTFAYFVTKNRKIYTIAKTLQKPIGIDYWDNKLYI 
SSVDKIYWKNVKEEINKSIKSHKDYTWKM^ 

VILSINLKTKKEEIVAFGVRNSVGFDFHPISNEIYFSDNGQDGLGDNIPPDEINVITEYKEHFGFPYVFGKNQKNY 
GFYNKAPKNTKFIPSIYELPAHVAPLGIHFYRGNNFPKEYINKLFIAEHGSWNRSSPVGYKITTLDIDSKTRTARN 
YKTFLYGFLKHDKSKFGRPVDIITYYDGSILFTDDFGNKIYRVYYEKI 



f943 .nt 



ATGAAAAATCAATTTTTAAATAGCTATTTTCAATTAATTACAACTATTTTCTTAATCTCATCTATAACTATTGCAG 
CAGAAGAAATAACAAGCACACTAAAAGTTCCTAATGGATTTAAAGTCGAAATTTTTTTAAACAATACAATTGAAAA 
ACCTAGAGGAATCACAAGCGATCAAGATGGAAATATATTCATAGGATCTGGAAGCACTTTTGCATACTTTGTAACA 
• AAAAACAGAAAAATTTATACCATAGCAAAAACCCTGCAAAAACCTATTGGTATTGATTATTGGGATAATAAACTCT 

ACATATCTTCTGTCGATAAAATATATGTAGTTAAAAATGT^ 

AGACTATACATGGAAAATGCAAATTTTTGCACTTTTGCCAAAAAATAATTCTCAAATGCACTCAGGACGTTACATT 
AAAGTAGATTCTAAAAATAACAAATTAATAGTAAATATAGGATCCCAGCACAATGTTAAAATTCCCCCAAAAAAAG 

' AAGCAGTAATCCTTAGTATTAATTTAAAAACAAAJ^^GAAGAAATAGTAG 
GTTTGATTTTCACCCAATTAGCAATGAAATATATTTTAGCGACAATGGCCAAGACGGATTAGGAGACAACATTCCC 
CCAGATGAAATAAACGTAATAACCGAATATAAAGAACATTTTGGATTTCCCTATGTGTTTGGAAAAAATCAAAAAA 
ATTACGGTTTTTATAACAAAGCACCCAAAAACACTAAGTTTATCCCATCTATTTACGAACTTCCGGCACATGTAGC 
TCCACTTGGAATACACTTTTACCGGGGAAATAACTTTCCAAAAGAATACATAAATAAATTATTCATAGCAGAACAC 
GGCTCGTGGAACAGATCTTCTCCTGTTGGCTACAAAATAACAACACTAGACATTGATTCTAAAACCAGAACAGCAA 
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GAAATT AC AAG AC TT TTTT AT ATGG ATTTTT AAAGC AC G AC AAATC T AAATT TGG AC GC C C TGTTG AT AT AATC AC 
ATATTATGACGGTTCAATTCTTTTTACAGATGACTTTGGAAATAAAATATACAGAGTTTACTACGAAAAGATTTAA 



. t943 .nt 



GAAATAACAAGCACACTAAAAGTTCCTAATGGATTTAAAGTCGAAATTTTTTTAAACAATACAATTGAAAAACCTA 
GAGGAATCACAAGCGATCAAGATGGAAATATATTCATAGGATCTGGAAGCACTTTTGCATACTTTGTAACAAAAAA 
CAGAAAAATTTATACCATAGCAAAAACCCTGCAAAAACCTATTGGTATTGATTATTGGGATAATAAACTCTACATA 
TCTTCTGTCGATAAAATATATGTAGTTAAAAATGTAAAAGAAGAAATTAATAAAAGCATAAAATCACATAAAGACT 
ATACATGGAAAATGCAAATTTTTGCACTTTTGCCAAAAAATAATTCTCAAATGCACTCAGGACGTTACATTAAAGT 
AGATTCTAAAAATAACAAATTAATAGTAAATATAGGATCCCAGCACAATGTTAAAATTCCCCCAAAAAAAGAAGCA 
GTAATCCTTAGTATTAATTTAAAAACAAAAAAAGAAGAAATAGTAGCTTTTGGAGTGAGAAACTCAGTTGGGTTTG 
ATTTTCACCCAATTAGCAATGAAATATATTTTAGCGACAATGGCCAAGACGGATTAGGAGACAACATTCCCCCAGA 
TGAAATAAACGTAATAACCGAATATAAAGAACATTTTGGATTTCCCTATGTGTTTGGAAAAAATCAAAAAAATTAC 
GGTTTTTATAACAAAGCACCCAAAAACACTAAGTTTATCCCATCTATTTACGAACTTCCCGCACATGTAGCTCCAC 
TTGGAATACACTTTTACCGGGGAAATAACTTTCCAAAAGAATACATAAAT AAATT ATTCATAGCAGAACACGGCTC 
GTGGAACAGATCTTCTCCTGTTGGCTACAAAATAACAACACTAGACATTGATTCTAAAACCAGAACAGCAAGAAAT 
TACAAGACTTTTTTATATGGATTTTTAAAGCACGACAAATGTAAATTTGGACGCCCTGTTGATATAATCACATATT 
ATGACGGTTCAATTCTTTTTACAGATGACTTTGGAAATAAAATATACAGAGTTTACTACGAAAAGATTTAA 



f952 .aa 



iyn^ARFAVLIVLLFFYIWFFIILRMKRTNLFLLEKIQNGAKILDIRSPKEYSKSHYLKSINIPFNNLFAKKDKLGD 
. FESPIIVYGKSFNKSYEAKKVLKSMGFKNVFVAGTLKDMPQAKKEVG 



t952 .aa ^ 

RMKRTNLFLLEKIQNGAKILDIRSPKEYSKSHYLKSINIPFNNLFAKKDKLGDFESPIIVYGKSFNKSYEAKKVLK 
SMGFKNVFVAGTLKDMPQAKKEVG * ^. 



f952 .nt- 



ATGAATTATGCAAGATTTGCAGTATTAATAGTTCTGCTTTTTTTTTATATTTGGTTTTTTATTATCCTTAGGATGA 
AAAGAACTAATCTGTTTTTGTTAGAAAAAATCCAAAATGGAGCAAAAATTTTGGATATTCGGTCTCCCAAAGAATA 
TAGCAAGTCTCATTATTTGAAGTCAATTAACATTCCTTTTAATAATTTATTTGCTAAAAAGGAT AAATT AGGTGAT 
TTTGAGTCCCCAATAATTGTTTATGGTAAAAGTTTTAATAAGTCTTACGAGGCTAAAAAAGTTTTAAAAAGCATGG 
GATTTAAGAATGTGTTTGTTGCTGGAACCTTGAAAGACATGCCACAAGCAAAAAAAGAAGTTGGTTGA 



• t952 '.nt 



AGGATGAAAAGAACTAATCTGTTTTTGTTAGAAAAAATCCAAAATGGAGCAAAAATTTTGGATATTCGG 
AAGAJ^T AT AGCAAGTCTCATTATTTGAAGTCAATTAACATTCCTTTTAATAATTTATTTGCTAAAAAGGAT AAATT 
AGGTGATTTTGAGTCCCCAATAATTGTTTATGGTAAAAGTTTTAATAAGTCTTACGAGGCTAAAAAAGTTTTAAAA 
AGCATGGGATTTAAGAATGTGTTTGTTGCTGGAACCTTGAAAGACATGCCACAAGCAAAAAAAGAAGTTGGTTGA 
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f378.aa 



MIKKFLLFAMLNIFLTNKAHSNEEIISISTEIQKEKYIPFLISRGKTQLEDLVKYTLEINPELDKNYVNTVAKTYI 
DESLIEGVNYDIAYAQMLLETGALKFNGIVSKEQHNFSGIGATNNLTKGNSFSNITEGIKAHIQHLKAYASKQNIK 
SNMVT5PRFYLVKRGSAPTIYDLTGKWAKDKLYDKKLKKILLELLEY1SINANKS 



.t378 .aa 



NEEIIEISTEIQKEKYIPFLISRGKTQLEDLVKYTLEINPELDKNYVNTVAKTYIDESLIEGVNYDIAYAQMLLET 

GALKFNGIVSKEQHNFSGIGATNNLTKGNSFSN^ 

LTGKWAKDKLYDKKLKKILLELLEYNNAJNKS- 



f378 .nt 



ATGATAAAAAAATTCTTGCTATTTGCAATGCTCAACATCTTTTTAACAAATAAAGCTCATAGTAATGAAGAGATAA 

TCGAAATAAGTACTGAAATACAAAAGGAAAAATATATTCCCTTTTTAATAAGTAGAGGAAAAA^ 

CCTTGTAAAATATACTCTAGAAATAAATCCAGAGCTTGACAAAAACTATGTAAATACTGTTGCTAAAACCTATATA 

GACGAATCTTTGATTGAAGGGGTTAATTATGACATTGCCTATGCTCAAATGTTACTAGAAACAGGAGCTCTAAAAT 

TCAATGGAATAGTTTCAAAAGAACAACACAATTTTTCAGGAATAGGCGCTACTAATAATCTTACAAAAGGAAATTC 

TTTTTCCAATATTACAGAAGGAATTAAAGCTCATATTCAACATTTAAAAGCTTATGCTTCAAAACAAAATATCAAA 

TCAAATATGGTTGATCCTAGATTTTACCTTGTTAAAAGAGGATCTGCTCCAACAATATATGATTTGACTGGGAAAT 

GGGC AAAAGAC AAAC TTT ACGAC AAAAAAC TT AAAAAAATATT ATTAGAAC T ATTAGAATAT AATAATGC AAAT AA 

AAGCTAA ' 



t378.nt 

AATGAAGAGATAATCGAAATAAGTACTGAAATACAAAAGGAAAAATATATTCCCTTTTTAATAAGTAGAGGAAAAA 
CTCAACTAGAAGACCTTGTAAAATATACTCTAGAAATAAATCCAGAGCTTGACAAAAACTATGTAAATACTGTTGC 
TAAAACCTATATAGACGAATCTTTGATTGAAGGGGTTAATTATGACATTGCCTATGCTCAAATGTTACTAGAAACA 
GGAGCTCTAAAATTCAATGGAATAGTTTCAAAAGAACAACACAATTTTTCAGGAATAGGCGCTACTAATAATCTTA 
CAAAAGGAAATTCTTTTTCCAATATTACAGAAGGAATTAAAGCTCATATTCAACATTTAAAAGCTTATGCTTCAAA 
ACAAAATATCAAATCAAATATGGTTGATCCTAGATTTTACCTTGTTAAAAGAGGATCTGCTCCAACAATATATGAT 
TTGAC TGGGAAATGGGC AAAAGAC AAAC TTT ACGAC AAAAAACTTAAAAAAAT ATT ATT AGAA.C TATTAGAAT AT A 
ATAATGCAAATAAAAGCTAA 



f 4 .aa 



MKLFRRJNTVMIKMPSSFTIIFSLIVFVTILTYVIPAG 

LTAMSKGMEHAVEVIVFVLIVGGAYGI IMKTGAIDVGIYFLIKKLGHKDKIjLIPLLMFIFSIGGTVTGMSEETLPF 
. YFVMIPLIVALGYDSLVGAAIIALGAGVGTMASTWPFATGIASAIASISLQDGFYFRIVLYFVSVLAAITWCVY 

ASKIKKDPSKSLVYSQKDEHYQYFVKKDGLSTGDNAQNALEFTFAHKLVLLLFGFMILILIFSIWLGWWMQEMTM 
^ LYLGVAIISAFICKLGETEMWDAFVKGSESLLTAALVIGLARGVMIVCDDGLITDTML 

" NEIIQIFIGFWPSSSGHASLTMPIMAPLADFLSIPRASWIAMQTASGLINLITPTSGVIMAVLGISRLSYGTW 
KFVLPLFMIEFFISILVIIANIYLSF 



t4 . aa 
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KFDKEFKQMGDGSKREIIVAGTYQYVDRGSRGFLHPIMTILTAMSKGMEHAVEVIVFVLIVGGAYGIIMKTGAIDV 
GIYFLIKKLGHKDKLLIPLLMFIFSIGGTVTGMSEETLPFYFVMIPLIVALGYDSLVGAAIIALGAGVGTMASTVN 
PFATGIASAIASISLQDGFYFRIVLYFVSVLAAITYVCVYASKIKKDPSKSLVYSQKDEHYQYFVKKDGLSTGDNA 
QNALEFTF AHKLVLLLFGFMI L I L I F S I VNLGWWMQEMTMLYLGVAX I S AF ICKLGETEMWDAFVKG S ES LLTAAL 
VIGLARGVMIVCDDGLITDTMLNA^ 

RASWIAMQTASGLINLITPTSGVIMAVLGISRLSYGTWFKFVLPLFMIEFFISILVIIANIYLSF 



f 4 .nt 



ATGAAATTATTTAGGAGAAACGTTATGATCAAAATGCCAAGTAGTTTTACAATAATATTTTCTTTAATTGTATTTG, 

TTACCATTTTAACGTATGTGATTCCTGGCGGTAAGTTTGATAAAGAATTTAAGCAAATGGGTGATGGATCTAAAAG 

GGAAATAATTGTTGCTGGAACTTATCAATATGTAGATCGAGGCTCTAGGGGATTTTTACATCCTATTATGACTATT 

T-TAACCGCAATGTCAAAGGGGATGGAACATGCAGTTGAAGTTATTGTTTTTGTTTTAATTGTTGGGGGTGCTTATG 

GGATTATTATGAAAACTGGAGCAATAGATGTGGGAATTTATTTTTTAATCAAGAAGT-TGGGGCACAAAGATAAGTT 

GCTTATTCCTTTGTTAATGTTTATTTTTTCAATTGGTGGAACTGTAACCGGAATGAGTGAAGAGACCCTTCCTTTT 

TATTTTGTTATGATTCCCTTGATAGTAGCTTTGGGTTATGATAGTCTTGTTGGAGC'GGCTATTATTGCTTTAGGAG 

CTGGAGTGGGAACTATGGCTTCTACTGTAAATCCATTTGCGACAGGAATTGCATCTGCAATAGCTTCTATTAGCTT 

GCAGGATGGATTTTATTTTAGAATTGTTCTTTATTTTGTATCAGTATTGGCTGCTATAACCTATGTTTGTGTTTAT 

GCGTCTAAAATTAAAAAGGATCCCTCAAAATCGCTTGTGTAT.TCTCAAAAAGATGAACATTATCAATATTTTGTTA 

AAAAAGATGGACTTTCTACCGGAGATAATGCTCAGAATGCTCTTGAGTTTACTTTTGCTCATAAATTAGTTTTACT 

TTTATTTGGATTTATGATATTGATTTTGATATTTAGCATTGTTAATCTTGGTTGGTGGATGCAAGAAATGACAATG 

TTGTATCTTGGAGTTGCTATTATATCGGCTTTTATTTGTAAATTAGGTGAAAGTGAAATGTGGGATGCGTTTGTGA 

AAGGTTCTGAAAGTCTGCTAACCGCTGCTCTTGTTATTGGACTTGCTAGAGGTGTTATGATAGTATGTGATGATGG 

GTTGATTAGAGATACTATGTTAAATGCTGCTACTAATTTTTTATACAATCTTCCAAGACCCCTTTTTATCATATTG 

AATGAAATTATTCAAATATTTATAGGATTTGTTGTTCCATCTTCATCAGGACATGCTAGTCTCACTATGCCAATAA 

TGGCTCCTCTTGCCGATTTTTTGTCAATTCCAAGAGCTTCAGTTGTTATTGCCATGCAGACTGCATCTGGGCTTAT 

TAATTTGATAACACCTACCAGCGGAGTTATAATGGCTGTATTGGGGATATCCAGATTGAGTTATGGTACGTGGTTT 

AAGTTTGTTTTACCATTATTTATGATTGAGTTTTTTATCTCTATTTTAGTTATTATAGCTAACATTTATTTAAGTT 

TTTAG 



t4 .nt 



: AAGTTTGATAAAGAATTTAAGCAAATGGGTGATGGATCTAAAAGGGAAATAATTGTTGCTGGAACT.TATCAATATG 
■TAGATCGAGGCTCTAGGGGATTTTTACATCCTATTATGACTATTTTAACCGCAATGTCAAAGGGGATGGAACATGC 
AGTTGAAGTTATTGTTTTTGTTTTAATTGTTGGGGGTGCTTATGGGATTATTATGAAAACTGGAGCAATAGATGTG 
GGAATTTATTTTTTAATCAAGAAGTTGGGGCACAAAGATAAGTTGCTTATTCCTTTGTTAATGTTTATTTTTTCAA 
TTGGTGGAACTGTAACCGGAATGAGTGAAGAGACCCTTCCTTTTTATTTTGTTATGATTCCCTTGATAGTAGCTTT 
GGGTTATGATAGTCTTGTTGGAGCGGCTATTATTGCTTTAGGAGCTGGAGTGGGAACTATGGCTTCTACTGTAAAT 
CCATTTGCGACAGGAATTGCATCTGCAATAGCTTCTATTAGCTTGCAGGATGGATTTTATTTTAGAATTGTTCTTT 
ATTTTGTATCAGTATTGGCTGCTATAACCTATGTTTGTGTTTATGCGTCTAAAATTAAAAAGGATCCCTCAAAATC 
GCTTGTGTATTCTCAAAAAGATGAACATTATCAATATTTTGTTAAAAAAGATGGACTTTCTACCGGAGATAATGCT 
CAGAATGCTCTTGAGTTTACTTTTGCTCATAAATTAGTTTTACTTTTATTTGGATTTATGATATTGATTTTGATAT 
TTAGCATTGTTAATCTTGGTTGGTGGATGCAAGAAATGACAATGTTGTATCTTGGAGTTGCTATTATATCGGCTTT 
TATTTGTAAATTAGGTGAAACTGAAATGTGGGATGCGTTTGTGAAAGGTTCTGAAAGTCTGCTAACCGCTGCTCTT 
GTTATTGGACTTGCTAGAGGTGTTATGATAGTATGTGATGATGGGTTGATTACAGATACTATGTTAAATGCTGCTA 

• CTAATTTTTTATACAATCTTCCAAGACCCCTTTTTATCATATTGAATGAAATTATTCAAATATTTATAGGATTTGT 
TGTTCCATCTTCATCAGGACATGCTAGTCTCACTATGCCAATAATGGCTCCTCTTGCCGATTTTTTGTCAATTCCA 
AGAGCTTCAGTTGTTATTGCCATGCAGACTGCATCTGGGCTTATTAATTTGATAACACCTACCAGCGGAGTTATAA 
TGGCTGTATTGGGGATATCCAGATTGAGTTATGGTACGTGGTTTAAGTTTGTTTTACCATTATTTATGATTGAGTT 

TTTTATCTCTATTTTAGTTATTATAGCTAACATTTATTTAAGTTTTTAG 



78 



TABLE 1. Nucleotide and Amino Acid Sequences 



f 43 .aa 



MKYFYFLFFLLIFNVYAQINrVNSPALPSPPLLPEIT 

IKALKKSSDSQYNFSLKKRLEKTFNAELKREILELFISLKYSGGIDTANYILENYESKRYSNALFGLAISYLKEFD 

DKEKLKKTLIDILENKEGNWSIAAYYLGELN^ 

SLDNYEGPSIKAAAIEALSYLASDKOTENADLYLQSNN^ 

KAINAIKGHRDSSAKDILIYKLKSDPSLKVREASAKALIDMDLGNIEIKNIMFDFKIDNNFKISMFSYLLDKDSLK 
ALSIALEIVNKENINRPSNVLRGVASMLAGKKGNFDNFYSKIIDSKNIDLRHLALKGAVYNKSSSLSDKLKKIKSE 
TNSEYIKMLLKDY 



t43 .aa 



LPSPPLLPEITENKPVERENSSKGENFSNVGLDGKYVNDTILYGLDSQVTSIIKALKKSSDSQYNFSLKKRLEKTF 

NAELKREILELFISLKYSGGIDTANYILENYESKRYSNALFGLAISYLKEFDDKEKLKKTLIDILENKEGNWSIA 

AYYLGELNSLEYSKNMMEVFEKYSGNDGARREILIALGK^ 

KVTENADLYLQSNNNNL]>TV1CLAI^ 

DPSLKWEASAKALIDMDLGNIEIKNIMFDFKIDNN^ 

ASMLAGKKGNFDNFYSKIIDSKNIDLRHLALKGAVYNKSSSLSDKLKKIKSETNSEYIKMLLKDY 



f 43 .nt 



ATGAAATACTTTTATTTTTTATTTTTTTTACTTATTTTTAATGTGTATGCTCAAAATGTTAATTCTCCAGCTCTTC 
CTAGTCCGCCTTTGTTGCCCGAAATTACAGAAAATAAGCCTGTTGAGAGAGAAAATTCTTCTAAGGGAGAGAATTT 
TTCTAATGTTGGTTTAGATGGTAAGTATGTTAACGATACAATTCTTTATGGGCTTGATAGTCAAGTGACAAGCATT 
ATAAAAGCTCTTAAAAAATCAAGCGATAGTCAATATAATTTTTCTCTTAAAAAAAGACTTGAGAAAACTTTTAATG 
CTGAGCTTAAAAGGGAAATACTTGAATTGTTTATTTCTCTTAAGTATTCGGGGGGCATTGATACAGCAAATTATAT 
TCTTGAAAATTATGAGAGTAAAAGATATTCAAACGCTTTATTTGGCTTGGCAATTTCGTATCTTAAGGAGTTTGAT 
GATAAAGAAAAATTAAAAAAAACTCTTATTGACATTCTTGAAAATAAAGAGGGCAATGTGGTATCTATTGCAGCTT 
ATTATTTAGGAGAGCTTAATTCTCT TGAGTATTCTAAAAACATGATGGAAGTTTTTGAAAAATATTCTGGAAATGA 
TGGGGCTAGAAGAGAAATACTTATTGCTCTTGGAAAAATGTCCGCTGTTGATTATCAGGATAGAATTTATGAAATT 
TCGCTAGATAATTACGAGGGCCCATCAATTAAGGCTGCTGCAATCGAAGCGTTGTCATATCTTGCTTCAGATAAAG 
TAACTGAAAATGCTGATTTGTATCTTCAGAGTAATAACAATAATTTAAATGTTAAATTAGCTATTATTGCTTCTTT 
GTCCAAAGATCCTTCTTTAAAGTCTAAAGAGATTTTACAAGGATTTTTAAGAGATTGTGATGATAATATTAGGTTT 
AAAGCTATTAATGCAATCAAAGGACATAGGGACTCTTCTGCAAAGGATATTTTGATTTATAAGCTTAAAAGCGATC 
CATCTCTTAAAGTTAGGGAGGCTTCTGCTAAGGCCTTAATTGATATGGATCTTGGGAATATTGAGATAAAAAACAT 
TATGTTTGATTTTAAGATTGACAATAATTTTAAAATTTCAATGTTTAGTTACCTTTTAGATAAGGATTCTCTAAAA 
GCATTGTCAATTGCTTTAGAAATTGTTAATAAAGAAAATATTAATAGACCCTCAAATGTTTTAAGGGGCGTTGCTT 
CAATGTTGGCTGGTAAAAAGGGTAATTTTGATAATTTTTATTCTAAAATCATTGACAGCAAAAATATTGATTTAAG 
GCATTTAGCATTAAAAGGAGCTGTTTATAATAAATCTTCATCGCTTTCTGATAAGCTTAAAAAAATTAAAAGTGAA 
ACGAACTCCGAATATATTAAAATGCTTTTAAAAGATTATTGA 



t43 .nt 



CTTCCTAGTCCGCCTTTGTTGCCCGAAATTACAGAAAATAAGCCTGTTGAGAGAGAAAATTCTTCTAAGGGAGAGA 
ATTTTTCTAATGTTGGTTTAGATGGTAAGTATGTTAACGATACAATTCTTTATGGGCTTGATAGTCAAGTGACAAG 
CATTATAAAAGCTCTTAAAAAATCAAGCGATAGTCAATATAATTTTTCTCTTAAAAAAAGACTTGAGAAAACTTTT 
AATGCTGAGCTTAAAAGGGAAATACTTGAATTGTTTATTTCTCTTAAGTATTCGGGGGGCATTGATACA.GCAAATT 
ATATTCTTGAAAATTATGAGAGTAAAAGATATTCAAACGCTTTATTTGGCTTGGCAATTTCGTATCTTAAGGAGTT 
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TGATGATAAAGAAAAATTAAAAAAAACTCTTATTGACATTCTTGAAAATAAAGAGGGCAATGTGGTATCTATTGCA 
GCTTATTATTTAGGAGAGCTTAATTCTCTTGAGTATTCTAAAAACATGATGGAAGTTTTTGAAAAATATTCTGGAA 
ATGATGGGGCTAGAAGAGAAATACTTATTGCTCTTGGAAAAATGTCCGCTGTTGATTATCAGGATAGAATTTATGA 
AATTTCGCTAGATAATTACGAGGGCCCATCAATTAAGGCTGCTGCAATCGAAGCGTTGTCATATCTTGCTTCAGAT 
.AAAGTAACTGAAAATGCTGATTTGTATCTTCAGAGTAATAACAATAATTTAAATGTTAAATTAGCTATTATTGCTT 
CTTTGTCCAAAGATCCTTCTTTAAAGTCTAAAGAGATTTTACAAGGATTTTTAAGAGATTCTGATGATAATATTAG 
GTTTAAAGCTATTAATGCAATCAAAGGACATAGGGACTCTTCTGCAAAGGATATTTTGATTTATAAGCTTAAAAGC 
GATCCATCTCTTAAAGTTAGGGAGGCTTCTGCTAAGGCCTTAATTGATATGGATCTTGGGAATATTGAGATAAAAA 
ACATTATGTTTGATTTTAAGATTGACAATAATTTTAAAATTTCAATGTTTAGTTACCTTTTAGATAAGGATTCTCT 
AAAAGCATTGTCAATTGCTTTAGAAATTGTTAATAAAGAAAATATTAATAGACCCTCAAATGTTTTAAGGGGC 
GCTTCAATGTTGGCTGGTAAAAAGGGTAATTTTGATAATTTTTATTCTAAAATCATTGACAGCAAAAATATTGATT 
TAAGGCATTTAGCATTAAAAGGAGCTGTTTATAATAAATCTTCATCGCTTTCTGATAAGCTTAAAAAAATTAAAAG 
TGAAACGAAC TC CGAAT AT ATTAAAATGCTTTTAAAAG ATT ATTG A ' 



f 50 .aa 



MKFVLI^LFKGCLiCFFLFFSCLTTDRSIQDSHISDIVEKKKEAVIIDDNNVVLGSNEGKFKRDYLIGLKDNESFF 
LSDAFLKENNFYFKKARESYAKKNIGLTISTYTLNKIVTNENQHSRELLAKANL 

DYKYSHASLRLAELKYLVKEKSDAISAFKEINEFSI-SGYDREIYGFLSNKLGVSHLNLESLGFLDNSVFDTFVFND 

NIFVTNILGGLLRYNIKKNDCRVYLKDKKSIFLNGIRGFADYNGTIYIGGKNVVYYIDDV^ 

WQVLLAVKNGIFVGTLNSGLWFYDLKNWKN^ 

FSKNDNEKNINFIKEYKDSYFVGTYGGGLFELNLNI^ 

SENDNWDYFGPNNGLLNLNLIKVSRFEISTYVILGTINNGLVFVDENIKKQL 



tSO.aa 

CLTTDRSIQDSHISDIVEKKKEAVIIDDNIWV^^ 

KKNIGLTNYYLNKIVTNENQHSRELL^KANLFFGYVNYENGFYDLSEYNFDLFLKDYKYSHA^ 

SDAISAFKEINEFSISGYDREIYGFLSNKLGVSHLNLESLGFLDNSVFDTFVFNDNIFVTNILGGLLRYNIKKNDC 
RVYLKDKKS I FLNGIRGFADYNGTI YIGGKNWYYIDDVDGD FVGTLNSGL 
WFYDLKNWKNIPLGSNKISSLCFDSLKjNLLLVGTVDKAIYSVNTONLKKIEH 
VGTYGGGLFELNLNKNSYKKPIVIAISnsriDVNYFMDMEIKDKKLLFATFDHGLLIYDSE^ 
KVSRFENYVILGTINNGLVFVDENIKKQL 



f50 .nt 



ATGAAATTTGTTTTGAATAATTTATTTAAAGGTTGTCTTATATGTTTTTTCTTGTTTTTTTCCTGCCTTACTACAG 
ATAGATCTATTCAAGATTCTCATATTAGTGATATTGTAGAGAAGAAAAAAGAAGCAGTCATTATTGATGATAATAA 
TGTTGTTCTTGGGAGTAATGAGGGTAAATTTAAAAGAGACTATTTGATAGGATTAAAAGATAATGAATCTTTTTTT 
CTTAGTGATGCTTTTTTAAAAGAAAATAATTTTTATTTTAAAAAAGCCAGGGAAAGTTATGCTAAAAAAAATATTG 
GCTTGACAAATTATTATTTGAATAAAATAGTAACTAATGAGAATCAGCACAGCAGAGAATTGCTAGCTAAAGCGAA 
TTTGTTTTTTGGATATGTAAATTATGAGAATGGTTTTTATGATCTTTCCGAATATAATTTTGATCTATTTTTAAAA 
GACTATAAATATTCTCATGCTAGTTTAAGATTAGCTGAATTAAAATATCTTGTTAAAGAAAAATCTGATGCAATTT 
CTGCATTTAAAGAGATTAATGAATTTTCTATCTCAGGTTATGATAGAGAGATTTATGGCTTTTTAAGTAATAAACT 
TGGAGTAAGTCATTTAAACTTAGAGTCTTTAGGATTTCTTGACAACAGCGTTTTTGATACATTTGTCTTTAATGAC 
AATATATTTGTAACTAATATATTGGGAGGGCTTTTAAGATATAATATTAAAAAAAATGATTGTAGAGTCTATCTTA 
AGGATAAAAAAAGCATTTTTTTAAATGGCATTAGGGGTTTTGCGGATTATAATGGAACAATTTATATTGGTGGTAA 
AAATGTTGTTTATTATATAGATGATGTTGATGGGGATTTAAAGCAAATAAATGTTCCCGGTAATGCTGATTTTAGC 
AATGTACAAGTTTTGCTTGCTGTTAAAAATGGAATATTTGTTGGGACTCTAAATTCTGGATTATGGTTTTATGATT 
TAAAAAATTGGAAAAATATACCGCTTGGATCTAATAAAATTTCTTCACTCTGCTTTGATAGTTTAAAAAATTTATT 
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ATTAGTTGGAACAGTTGACAAGGCTATTTATAGTGTTAATGTCGATAATTTGAAAAAGATTGAACATTTGGATTTT 
TTTAGCAAAAATGATAATGAAAAAAATATTAATTTTATAAAAGAATATAAAGATAGTTATTTTGTTGGAACATATG 
GTGGGGGTCTTTTTGAATTAAATTTAAATAAAAATAGTTACAAAAAGCACGTTATTGCCAATAATATTGATGTTAA 
TTATTTTATGGATATGGAGATTAAAGATAAAAAGCTATTGTTTGCAACCTTTGATCATGGGTTATTGATTTATGAT 
TCTGAAAATGACAACTGGGATTATTTTGGACCCAATAATGGGCTTCTTAATTTGAATTTAATAAAAGTTTCTAGAT 
TTGAAAATTATGTCATACTGGGCACTATTAATAACGGTTTGGTTTTTGTAGATGAAAATATTAAAAAACAGTTATG 

A 



t50 . nt 



TGCCTTACTACAGATAGATCTATTCAAGATTCTCATATTAGTGATATTGTAGAGAAGAAAAAAGAAGCAGTCATTA 

TTGATGATAATAATGTTGTTCTTGGGAGTAATGAGGGTAAATTTAAAAGAGACTATTTGATAGGATTAAAAGATAA 

TGAATCTTTTTTTCTTAGTGATGCTTTTTTAAAAGAAAATAATTTTTATTTTAAAAAAGCCAGGGAAAGTTATGCT 

AAAAAAAATATTGGCTTGACAAATTATTATTTGAATAAAATAGTAACTAATGAGAATCAGCACAGCAGAGAATTGC 

TAGCTAAAGCGAATTTGTTTTTTGGATATGTAAATTATGAGAATGGTTTTTATGATCTTTCCGAATATAATTTTGA 

TCTATTTTTAAAAGACTATAAATATTCTCATGCTAGTTTAAGATTAGCTGAATTAAAATATCTTGTT^AAGAAAAA 

TCTGATGCAATTTCTGCATTTAAAGAGATTAATGAATTTTCTATCTCAGGTTATGATAGAGAGATTTATGGCTTTT 

TAAGTAATAAACTTGGAGTAAGTCATTTAAACTTAGAGTCTTTAGGATTTCTTGACAACAGCGTTTTTGATACATT 

TGTCTTTAATGACAATATATTTGTAACTAATATATTGGGAGGGCTTTTAAGATATAATATTAAAAAAAATGATTGT 

AGAGTCTATCTTAAGGATAAAAAAAGCATTTTTTTAAATGGCATTAGGGGTTTTGCGGATTATAATGGAACAATTT 

ATATTGGTGGTAAAAATGTTGTTTATTATATAGATGATGTTGATGGGGATTTAAAGCAAATAAATGTTCCCGGTAA 

TGCTGATTTTAGCAATGTACAAGTTTTGCTTGCTGTTAAAAATGGAATATTTGTTGGCACTCTAAATTCTGGATTA 

TGGTTTTATGATTTAAAAAATTGGAAAAATATACCGCTTGGATCTAATAAAATTTCTTCACTCTGCTTTGATAGTT 

TAAAAAATTTATTATTAGTTGGAACAGTTGACAAGGCTATTTATAGTGTTAATGTCGATAATTTGAAAAAGATTGA- 

ACATTTGGATTTTTTTAGCAAAAATGATAATGAAAAAAATATTAATTTTATAAAAGAATATAAAGATAGTTATTTT 

GTTGGAACATATGGTGGGGGTCTTTTTGAATTAAATTTAAATAAAAATAGTTACAAAAAGCACGTTATTGCCAATA 

ATATTGATGTTAATTATTTTATGGATATGGAGATTAAAGATAAAAAGCTATTGTTTGCAACCTTTGATCATGGGTT 

ATTGATTTATGATTCTGAAAATGACAACTGGGATTATTTTGGACCCAATAATGGGCTTCTTAATTTGAATTTAATA 

AAAGTTTCTAGATTTGAAAATTATGTCATACTGGGCACTATTAATAACGGTTTGGTTTTTGTAGATGAAAATATTA 

AAAAACAGTTATGA 



f65.aa " • ' 

MHIFKNVPFQINLILFLLVSVAKINASSKFY^ ." 
ETKEQWEKYKLLFK31HWLLLVRQNLHLGDLFDT 

ENYTTVKLENDGITNWEDEYHKISLKELNYYDIIKKELLRIDETKAFFEQGPNYY • 

t65 .aa ' ■ ' 

KINASSKFYYAEQWWIFNSQMKKKPENYKKNIFFLQKALKYPFGNPK^ 

RQNLHLGDLFDTRNLYFFKTPEKDGIISNLEKSKKLYKLAIN^^ 

I S LKELNYYD 1 1 KKE LLRI DETKA FFEQG PNYY 

f65.nt 

ATGCATATTTTCAAAAATGTCCCCTTCCAAATAAATTTAATTTTATTTCTTTTAGTATCAGTTGCAAAGATAAATG 
CATCGTCCAAATTTTATTACGCAGAACAATGGTATGTAATTTTTAATTCTCAAATGAAAAAAAAACCTGAAAACTA 
TAAAAAAAATATATTTTTTCTTCAAAAAGCCTTAA 

GAAACCAAAGAACAGTGGGAAAAATATAAACTTCTTTTCAAAATGCATGTAAJ^CTTGCTTCTAGTTAGGCA^ 

TACATTTAGGAGATTTATTCGACACAAGAAATTTATATTTTTTCAAAACTCCAGAAAAAGATGGAATTATTTCCAA 

TCTAGAAAAATCAAAAAAATTATATAAACTAGCTATTAATTACTACAGCGAAGCACTAAAATACCACAAAAAACTT 

GAAAATTACACAACTGTTAAACTAGAAAACGATGG 
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AAGAGCTTAATTACTATGACATTATTAAAAAAGAACTACTAAGAATTGACGAAACTAAAGCATTTTTTGAACAAGG 
GCCAAACTATTATTAA 

t65.nt 

r- 

KINASSKFYYAEQWYVIFNSQMKKKPENYKKNIFFLQKALKYPFGNPKYSLTKIETKEQWEKYKLLFKMHVNLLLV 

RQNLHLGDLFDTRNLYFFKTPEKDGIISNLEKSK^^ 
ISLKELNYYDIIKKSLLRIDETKAFFEQGPNYY 



f8.aa 

MKNINRLILLILTTHTLLFSCAL^ 

LIKKTAAKIKISPQKLEEIKNYLNAYKNYLNNETEWIKFIDQSSVNGNLTIKIDTAFEKKTNFNHTNSDNENLTEL 
IELQMHLEKEILNLIEQTFHDKNLGYIQLSHINSFFPQENINSITKEIIDGKEYIAPHIIANQLLKIKDKKYFEQF 
MHFLKVENSKIKTIIEKQKISDLHNELYYSKQSPPRRRKRSTADSDNNNKYDIIPKIIDPNTGIEITPKNLRSILS 
NGDIILIKPKIDWTEFFYFWQHVGIFDEEKYEATKKIAFNGipSFDIKSIITSNQIKFDTASTQGSGYEKLSTYVQ. 
SRILKIFSPITDIRTIQKAINFGRSRYIDNNFGYMVPLISSNLWTDSFNLEEIHNKTYCSLMVDRIYKIAGLNVSR 
NYEISGIITPGEINAAAYNFYMSYTIAGILPSVLPKRLIKPTLKEKFIGYWKEIVDAIELKKSKEKIFGRACNITN 

LWCSGS 
t8 .aa 

CALIADNKSKNLSTSfellLTQKTLLESSLIK^ 
NYLNAYKNYLNNETEWIKFIDQSSVNGN^ 

DKNLGYIQLSHiNSFFPQENINSITKEIIDGKEYIAPHIIANQLLKIKDKKYFEQFMHFLKVENSKIKTIIEKQKI 
SDLHNELYYSKQSPPRRRKRSTADSDNNNKYDIIPKIIDPN-TGIEITPKNLRSILSNGDIILIKPKIDWTEFFYFW 
QHVGIFDEEKYEATKKIAFNGIDSFDIKSIITSNQIKFDTASTQGSGYEKLSTYVQSRILKIFSPITDIRTIQKAI 

nfgrsryidnnfgymvplissnlwtdsfnleeihnktycslmvdriykiaglnvsrnyeisgiitpgeinaaa™ 

YMSYTIAGILPSVLPKRLIKPTLKEKFIGYNKEIVDAIELKKSKEKIFGRACNITNLWCSGS 
f8.nt 



ATGAAGAATATTAATAGATTAATATTATTAATATTAACTACACACACTTTATTATTCTCTTGTGCCTTAATTGCAG 

ATAATAAGTCAAAAAATTTAAGCACATCAGAAATCATATTAACACAAAAAACACTACTAGAAAGCTCTTTAATA^ 

AAATCCTTCTAATGTAGAATATCGAATACCAATATCCAGTATCCAAGAAATTTTAAACAATAACAATGATTCTTTT 

TTAATAAAAAAAACAGCAGCAAAAATCAAAATAAGCCCTCAAAAACTTGAAGAAATAAAAAACTAT 

ATAAAAATTATCTAAATAATGAAACAGAATGGATAAAGTTTATAGATCAAAGTAGCGTCAATGGAAATTTAACAAT 

TAAAATTGATACTGCTTTTGAAAAAAAAACAAATTTTAATCATACAAATTCAGATAATGAAAATTTAACAGAACTA 

ATAGAACTACAAATGCATCTGGAAAAAGAAATTTTAAACTTi^TTGAGCAAACATTTCATGATAAAAATTTAGGAT 

ATATACAATTAAGTCACATCAACTCATTCTTTCCTCAAGAAAATATAAACTCAATAACAAAAGAAATAATAGATGG 

AAAAGAATATATTGCACCGCACATAATAGCAAATCAATTATTAAAAATAAAAGATAAAAAATATTTTGAACAATTT 

ATGCACTTTTTAAAAGTTGAAAACAGCAAAATAAAAACAATAATTGAAAAACAA^ 

AACTGTATTATTCAAAACAATCCCCGCCCAGAAGAAGAAAAAGGTCAACTGCCGATTCCGATAATAACAATAAATA 

CGATATAATACCAAAAATAATAGACCCAAATACAGGCATTGAAATAACTCCTAAAAATTTAAGATCTATTTTATCA 

AATGGCGACATAATACTAATAAAACCAAAAATAGATTGGACAGAATTTTTTTATTTTTGGCAACATGTGGGAATAT 

TTGATGAAGAAAAATATGAAGCCACTAAAAAAATTGCATTCAATGGAATTGATAGCTTTGATATAAAATCAATAAT 

TACAAGCAATCAAATCAAATTCGATACAGCATCTACTCAA.GGTTCAGGATACGAAAAGCTTTCAACATACGTACAA 

TCAAGAATATTAAAAATATTCTCACCAATAACAGACATAAGAACAATTCAAAAAGCTATTAATTTT 

GATACATTGACAATAACTTTGGATATATGGTTCCATTAATATCCTCTAATTTATGGACAGATTCATTCAATCTTGA 

AGAAATTC AC AAC P> AAACC TATTGCTC TTT AATGGTTGAT AGAAT AT AT AAAAT AGC AGGAOTT AATGTATC AAG A 



TABLE 1. Nucleotide and Amino Acid Sequences 



AATTACGAAATTTCGGGAATAATTACTCCTGGAGAAATAAATGCAGCAGCTTACAATTTTTACATGTCTTATACGA 
TTGCAGGAATACTTCCAAGCGTGCTTGCAAAAAGGCTCATTAAACCAACATTAAAAGAAAAATTCATTGGTTACAA 
TAJ^AGAAATAGTAGATGCAATAGAATTAAAAAAATCGAAAGAAAAAATTTTTGGGAGAGCTTGCAACATTACAAAT 

CTCTGGTGCTCAGGAAGTTAA 



t8 .nt 



TGTGCCTTAATTGCAGATAATAAGTCAAAAAATTTAAGCACATCAGAAATCATATTAACACAAAAAACACTACTAG 
AAAGCTCTTTAATAAAAAATCCTTCTAATGTAGAATATCGAATACCAATATCCAGTATCCAAGAAATTTTAAACAA 
TAACAATGATTCTTTTTTAATAAAAAAAACAGCAGCAAAAATCA 

AACTATCTAAATGCTTATAAAAATTATCTAAATAATGAAACAGAATGGATAAAGTTTATAGATCAAAGTAGCGTCA 
ATGGAAATTTAACAATTAAAATTGATACTGCTTTTGAAAAAAAAACAAATTTTAATCATACAAATTCAGATAATGA 
AAATTTAACAGAACTAATAGAAGTACAAATGCATCTGGAAAAAGAAATTTTAAACTTAATTGAGCAAACATTTCAT 
GATAAAAATTTAGGATATATACAATTAAGTCACATCAACTCATTCTTTCCTCAAGAAAATATAAACTCAATAACAA 
AAGAAATAATAGATGGAAAAGAATATATTGCACCGCACATAATAGCAAATCAATTATTAAAAATAAAAGATAAA^ 
ATATTTTGAACAATTTATGCACTTTTTAAAAGTTGAAAACAGCAAAATAAAAACAATAATTGAAAAACAAAAAATT 
TCAGATCTTCACAATGAACTGTATTATTCAAAACAATCCCCGCCCAGAAGAAGAAAAAGGTCAACTGCCGATTCCG 

AT AAT AAC AATAAAT AC GATAT AAT AC C AAAAATAATAGAC C C AAAT AC AGGC ATTGAAAT AACTC C T AAAAATTT 
AAGATCTATTTTATCAAATGGCGACATAATACTAATAAAACCAAAAATAGATTGGACAGAATTTTTTTATTTTTGG 
"CAACATGTGGGAATATTTGATGAAGAAAAATATGAAGCCACTAAAAAAATTGCATTCAATGGAATTGATAGCTTTG 
ATATAAAATCAATAATTACAAGCAATCAAATCAAATTCGATACAGCATCTACTCAAGGTTCAGGATACGAAAAGCT 
TTCAACATACGTACAATCAAGAATATTAAAAATATTCTCACCAATAACAGACATAAGAACAATTCAAAAAGCTATT 
AATTTTGGAAGAAGTAGATACATTGACAATAACTTTGGATATATGGTTCCATTAATATCCTCTAATTTATGGACAG 
ATTCATTCAATCTTGAAGAAATTCACAACAAAACCTATTGCTCTTTAATGGTTGATAGAATATATAAAATAGCAGG 
ACTTAATGTATCAAGAAATTACGAAATTTCGGGAATAATTACTCCTGGAGAAATAAATGCAGCAGCTTACAATTTT 
TACATGTCTTATACGATTGCAGGAATACTTCCAAGCGTGCTTCCAAAAAGGCTCATTAAACCAACATTAAAAGAAA 
AATTCATTGGTTACAATAAAGAAATAGTAGATGCAATAGAATTAAAAAAATCGAAAGAAAAAATTTTTGGGAGAGC 

TTGCAACATTACAAATCTCTGGTGCTCAGGAAGTTAA 



f 82 . aa 



MTRVFSKFFLFFCFSMLLFANSEDSNEKDIVSKDENPVFENEVLGYWVGYNDVSNIKNSI-IYIYKYNGEVYGRILT 
IIKDGKKYDAKNPSGDTVVGFENLAIEGLDFMWGLKYSSSSKKWDRGKI IDPKNGKIYNSEMRVDSKTGNLTTKGK 
VWIFGRSKIWTRAKDDEIPKLDLHNLVPAPPVKK 



t82."aa 

EDSNEKDIVSKDENPVFENEVLGYWVGYNDVSNIKNSIIYIYKYNGEVYGRILTIIKTC 

NLA I EGLDFMWGLKYS S SSKKWDRGKI IDPKNGKI YNS EMRVDSKTGNL XTKGKVWI FGR.S KIWTRAKDDEI PKLD 
LHNL V PA? P VKK ' 



f82nt 

ATGACTAGAGTTTTTTCAAAGTTTTTTCTTTTTTTTTGTTTTTCAATGCTTTTATTTGCAAATTCAGAAGATTCAA 
ATGAAAAGGACATTGTTAGCAAGGATGAAAACCCTGTTTTTGAAAATGAAGTTTTAGGATATTGGGTTGGTTATAA 
TGATGTAAGTAACATAAAGAATTCTATTATCTATATTTATAAATATAATGGGGAAGTTTATGGCCGAATTTTAACT 

AT AAT AAAAGATGGC AAAAAGT ATGATGC T AAAAATC CTTC AGGAG ATAC TGT AGTTGGGTTTGAAAATC TTGC AA 
TAGAGGGTCTTGATTTTATGTGGGGTCTTAAGTATTCTTCTTCTTCTAAAAAGTGGGATAGGGGCAAAATAATAGA, 
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TCCTAAAAACGGTAAAATTTATAATTCTGAGATGCGTGTTGATAGTAAAACCGGAAATCTTATTACCAAGGGGAAA 
GTTTGGATTTTTGGTAGAAGTAAAATTTGGACAAGAGCTAAAGATGATGAAATACCAAAATTAGATTTGCATAATC 
TTGTTC C AGC GC C CC C TGTG AAAAAAT AA 

f82.nt, 

GAAGATTCAAATGAAAAGGACATTGTTAGCAAGGATGAAAACCCTGTTTTTGAAAATGAAGTTTTAGGATATTGGG 

TTGGTTATAATGATGTAAGTAACATAAAGAATTCTATTATCTATATTTATAAATATAATGGGGAAGTTTATGGCCG ' 

AATTTTAACTATAATAAAAGATGGCAAAAAGTATGATGCTAAAAATCCTTCAGGAGATACTGTAGTTGGGTTTGAA 

AATCTTGCAATAGAGGGTCTTGATTTTATGTGGGGTCTTAAGTATTCTTCTTCTTCTAAAAAGTGGGATAGGGGGA 

AAATAATAGATCCTAAAAACGGTAAAATTTATAATTCTGAGATGCGTGTTGATAGTAAAACCGGAAATCTTATTAC 

CAAGGGGAAAGTTTGGATTTTTGGTAGAAGTAAAATTTGGACAAGAGCTAAAGATGATGAAATACCAAAATTAGAT 

TTGCATAATCTTGTTCCAGCGCCCCCTGTGAAAAAATAA 



f 36". aa 



MNKLMLMLITFATSLLAQTNKA 

LIIKILEENNI INEIGSNNIESKNIALVNVSLQVKGNTIKGSKHKACVASILDSKDLTNGILLKTNLKNKEGEIIA 
IASGITQPNNKLKGSGYTIDSVIINENQNINHSYNIILKKGKYTLINRIHKILTSKKINNKIKSDSTIEIEAKNIS 
LLEEIENIKIETNPKILIDKKNGIILASENAKIGTFTFSIEKDNQNIFLSKNNKTTIQVNSMKLNEFILKNSNNLS 
NKELIQIIQAAQKINKLNGELILEEIDGNQN 



t86 .aa 



LKTDQSFNNSLSESVKLKEIADIYPTNTNFLTGIGIVAGLAGKGDSIKQKDLIIKILEENNIINEIGSNNIESKNI 
ALVNVSLQVKGNTIKGSKHKACVASILDSKDLTNGILLKTNLKNKEGEXIAIASGITQPNNKLKGSGYTIDSVIIN" 
ENQNINHSYNIILKKGNYTLINRIHKILTSKKIISnMKIKSDSTIEIEAKNISLLEEIENIKIETNPKILIDKKNGII 
LASENAKIGTFTFSIEKDNQNIFLSKNNKTTIQVNSMKLNEFILKNSNNLSNKELIQIIQAAQKINKLiN 

IDGNQN 
f86.~nt 



ATGAACAAACTAATGTTGATGTTAATTACATTTGCAACGAGTCTATTAGCCCAAACAAACAAAPCTTCAACAGGAC 
TAAAAACAGATCAATCATTTAACAATAGCCTATCTdAAAGCGTAAAATTAAAAGAAATTGCGGATATTTATCCCAC 
AAATACAAATTTTTTAACAGGTATTGGAATAGTAGCGGGACTTGCTGGAAAAGGAGACTCTATAAAACAAAAAGAC 
CTTATAATTAAAATTTTAGAAGAAAACAATATAATAAATGAAATAGGCTCTAATAAC^ 

CACTAGTAAATGTCAGTCTCCAAGTAAAAGGTAATACAATCAAAGGTTCAAAACATAAAGCTTGCGTTGCATCAAT 
ACTGGACTCAAAAGATTTAACAAATGGAATACTTTTAAAAACAAATCTTAAAAATAAAGAGGGGGAAATAATAGCA 

ATTGC ATC AGGAATTACAC AGC C C AATAAT AAATT AAAAGGATC TGG AT AT AC TATAGAT AGTGT AAT AAT AAATG 
AGAATCAAAATATTAACCACAGTTATAATATAATTCTTAAAAAAGGAAATTATACATTAATAAATAGAATTCATAA 
AAT ATT AACCTCTAAAAAAATC AAC AAC AAAATT AAATC AGAC AGC AC AAT AGAAAT AGAAGC AAAAAAC AT AAGC 
CT ATT AGAAGAGATTGAAAAT ATTAAAATAGAAAC C AAC C C C AAGAT ATT AAT AGAC AAAAAAAATGGT ATT ATTT 
TAGCAAGTGAAAATGCAAAAATAGGAACTTTTACATTTTCCATTGAAAJVAGACAATCAAAACATTTTTTTAAGT 
AAATAACAAAACAACAATTCAAGTAAACTCAATGAAATTAAATGAATTTATATTAAAAAATTCCAJVCAATCTTAGC 
AAT AAAGAATTAATTCAAATAATTC AAGC TGCGCAAAAAATT AAT AAATT 
TTGATGGAAAC C AAAATT AA 



t86 .nt 
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CTAAAAACAGATCAATCATTTAACAATAGCCTATCTGAAAGCGTAAAATTAAAAGAAATTGCGGATATTTATCCCA 
CAAATACAAATTTTTTAACAGGTATTGGAATAGTAGCGGGACTTGCTGGAAAAGGAGACTCTATAAAACAAAAAGA 
CC TT AT AATT AAAATTTT AGAAGAAAAC AATAT AATAAATGAAAT AGGC TC T AATAAC ATA.GAAAGTAAAAATATT 
GCACTA.GTAAATGTCAGTCTCCAAGTAAAAGGTAATACAATCAAAGGTTCAAAAGATAAAGCTTGCGTTGCATCAA 
TACTGGACTCAAAAGATTTAACAAATGGAATACTTTTAAAAACAAATCT^ 

AATTGC A/TC AGG AATT AC AC AGC C C AAT AAT AAATTAAAAGGATC TGG AT AT AC T ATAGAT AGTGT AATAATAAAT 
GAGAATCAAAATATTAACCACAGTTATAATATAA-TTCTTAAAAAAGGAAATTATACATTAATAAATAGAATTCATA 
AAATATTAACCTCTAAAAAAATCAACAACAAAATTAAATCAGACAGCACAJ\^ 

CCTA.TTAGAAGAGATTGAAAATATTAAAATAGAAACCAACCCCAAGATATTAATAGACAAAAAAAATGGTATTATT 

TTAGCAAGTGAAAATGCAAAAATAGGAACTTTTACATTTTCCATTGAAAAAGACAATCAAAACATTTTTTTAAGTA 

AAAATAACAAAACAACAATTCAAGTAAACTCAATGAAATTAAATGAATTTATATTAAAAAATTCCA^ 

CAATAAAGAATTAATTCAAATAATTCAAGCTGCGCAAAAAATTAATAAJV^ 

AT TG ATGG AAAC C AAAAT T AA 



f90.aa 

MCPITFTIPFFLAIFFAFSSSFVTDSSVSLLSRNTSIjFSTLTPISLPIISGTLPAIVTLSKKYLSISLSFSKMIFI 
•KSLFEVIKLPIWLFI IFASGYFLNAFSIFLCISSFLSFMFI . 

t90.aa 

SSFVTDSSVSLLSRNTSLFSTLTPISLPIISGTLPAIVTLSKKYLSISLSFSKMIFIKSLFEVIKLPIWLFIIFAS 
GYFLNAFSIFLCISSFLSFMFI 

f90.nt 



ATGTGTCCTATTACTTTTACCATTCCATTTTTTCTAGCAATATTTTTTGCTTTTTCAAGCTCCTTTGTTACGGACT 
CTTCTGTGTCTTTGCTATCAAGAAATACGTCTCTTTTTTCTACTTTAACTCCAATTTCTTTGCCTATTATTTCTGG 
TACGCTTCCTGCAATAGTTACGCTGTCGAAAAAATATCTGTCAATCTCTTTAAGCTTTTCTAAAATGATTTTCATC 
AAATCTTTATTTGAAGTGATTAAACTTCCCATATGGTTATTCATTATTTTTGCATCAGGATACTTTTTAAATGCTT 
TTTCGATTTTTTTGTGTATTTCTTCTTTTTTATCTTTTATGTTTATATGA 



t90 .nt 



AGCTCCTTTGTTACGGACTCTTCTGTGTCTTTGCTATCAAGAAATACGTCTCTTTTTTCTA.CTTTAACTCCAATTT 
CTTTGCCTATTATTTCTGGTACGCTTCCTGCAATAGTTACGCTGTCGAAAAAATATCTGTCAATCTCTTTAAGCTT 
TTCTAAAATGATTTTCATCAAATCTTTATTTGAAGTGATTAAACTTCCCATATGGTTATTCATTATTTTTGCATCA 
GGATACTTTTTAAATGCTTTTTCGATTTTTTTGTGTATTTCTTCTTTTTTATCTTTTATGTTTATATGA 



f469.aa 

MAiSH/ALSSGFISQKIFGIII IMVFLPTIIATPIINFLFKXNKSGLKKELPIDQNTHICVSFEYDNLAKILIWDFKN 

ELRKEGFFTQQIIO^SSQYINARKNNISFSIKREGSKITFECPNNHLIIIQDLFF-ETILNLEKITKEVETVSLFJ^ 

KLDYSINYDKILSNINLNKP.IKKENXILELKSSNKAX)VIRELLSVINIEIDKERIFQDLMEREKLITTALKEGFAI 

PHLKTNLISKIHIAIGISHEGIDFNAJ^DKNL^ 

NIIVS2 



t469 .aa 
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VFLPTIIATPIINFLFKINKSGLKKELPIDQNTHICVSFEYDNLAKILIWDFKNELRKEGFFTQQIKNDSSQYINA 
RKNNISFSIKREGSKITFECPNNHLIII^ 

KENIILELKSSNKADVIRELLSVINIEIDKERIFQDLMEREKLITTALKEGFAIPHLKTNLISKIHIAIGISHEGI 
DFNALDKNLS HVF I L I LC P AKDYVS Y PRI LAS WGKVDL YKKE I LNAKTDKE I YNI I VS Z 

f469.nt 

ATGGCAAATGTAGCATTATCTTCAGGATTTATTAGCCAAAAAATATTTGGAATCATAATAATAATGGTGTTTTTGC 

CAACAATCATTGCAACACCCATAATAAACTTTTTATTTAAAATAAATAAAAGTGGACTTAAAAAAGAACTCCCAAT 

AGATCAAAATACACACATATGCGTATCATTTGAATATGATAATTTAGCCAAAATTCTTATATGGGACTTTAAAAAT 

GAGTTAAGAAAAGAAGGATTTTTTACACAACAAATTAAAAATGATTCTTCACAATATATTAATGCAAGAAAAAACA 

ATATATCCTTCTCAATAAAACGAGAAGGTAGCAAAATCACATTTGAATGCCCAAATAATCATTTAATTATAATACA 

AGATCTTTTTAGAGAAACAATCTTAAACCTAGAAAAAATAACCAAAGAAGTTGAAACAGTCTCTTTAAGAGCAAAA 

AAACTAGATTACTCAATAAATTACGATAAAATCCTTAGTAATATCAACCTAAATAAAAGAATAA 

TT ATTC T AGAATT AAAATC AAGC AAT AAGGC TG ATGT AAT AAG AGAGC TTC T AAGCGT AAT AAAC ATTG AAATTGA 

TAAAGAAAGAATATTCCAAGATTTAATGGAAAGAGAAAAGTTAATTACTACTGCACTAAAAGAAGGCTTTGCCATT 

CCCCATTTAAAAACAAATTTAATATCAAAAATACATATTGCAATAGGAATAAGCCATGAGGGAATTGACTTTAATG 

CTCTTGACAAGAACTTAAGTCATGTTTTTATATTAATACTGTGCCCAGCAAAAGAtTAGGTTAGCTACCCTAGAAT 

TTTAGCATCTGTTGTGGGCAAAGTTGATCTGTACAAAAAAGAAATTTTAAATGCAAAAACAGATAAAGAAATTTAT 

AATATAATAGTGAGCTAA ^ 
t469.nt 

TTTTTGCCAACAATCATTGCAACACCCATAATAAACTTTTTATTTAAAATAAATAAAAGTGGACTTAAAAAAGAAC 
TCCCAATAGATCAAAATACACACATATGCGTATCATTTGAATATGATAATTTAGCCAAAATTCTTATATGGGACTT 
TAAAAATGAGTTAAGAAAAGAAGGATTTTTTACACAACAAATTAAAAATGATTCTTCACAATATATTAATGCAAGA 
AAAAACAATATATCCTTCTCAATAAAACGAGAAGGTAGCAAAATCACATTTGAATGCCCAAATAATCATTTAATTA 
TAATACAAGATCTTTTTAGAGAAACAATCTTAAACCTAGAAAAAATAACCAAAGAAGTTGAAACAGTCTCTTTAAG 
AGCAAAAAAACTAGATTACTCAATAAATTACGATAAAATCCTTAGTAATATCAACCTAAATAAAAGAATAAAAAA 
GAAAACATTATTCTAGAATTAAAATCAAGCAATAAGGCTGATGTAATAAGAGAGCTTCTAAGCGTAATAAACATTG 
AAATTGATAAAGAAAGAATATTCCAAGATTTAATGGAAAGAGAAAAGTTAATTACTACTGCACTAAAAGAAGGCTT 
-TGCCATTCCCCATTTAAAAACAAATTTAATATCAAAAATACATATTGCAATAGGAATAAGCCATGAGGGAATTGAC 
TTTAATGCTCTTGACAAGAACTTAAGTCATGTTTTTATATTAATACTGTGCCCAGCAAAAGATTACGTTAGCTACC 
CTAGAATTTTAGCATCTGTTGTGGGCAAAGTTGATCTGTACAAAAAAGAAATTTTAAATGCAAAAA 
AATTTAT AATATAATAGTGAGCTAA < 

f477.aa. 

MEKPQGVSIVGAISGAMHVHLMAEHYGVPVVLHTDHCAKNLLPWEGLLEYGEKYYSQHKK 

KENIEISKKFLERMAKIEMFLEIELGITGGEEDGVDNSDRALHELFSTPEDIYYGYSELLKVSPNFQIAAAFGNVH 
GVYKPGNVKLTPKVLKDGQDYVISKTGVNMAKPV^ 

YYKKNESRLQGQLGDGKDIDIPNKKFYDPRVWLREAEVSMKDRVKIACKNLNNINRN2 
t477..aa 

MHVHLMAEHYGVPVVLHTDHCAKNLLPWVEGLLEYGE 

IEMFLEIELGITGGEEDGVDNSDRALHELFSTPEDIYYGYSELLKVSPNFQIAAAFGNVHGVYKPGN\^LTPKVLK 
DGQDYVISKTGVNMAKPVSYVFHGGSGSTIDEINEALSYGWKMNIDTDTQWAAWEGVLNYYKKNESR^ 
KDIDI PNKKFYDPRVWLREAEVSMKDRVKIACKNLNNINRNZ 

£477. nt 

ATGGAAAAACC AC AAGGAGTTTC AAT AGTTGGAGCTATTTCTGGTGCTATGCATGTTCATTTAATGGC AGAGC ATT 
. ATGGTGTTCCTGTTGTTCTTCATACTGATCACTGTGCTAAAAATTTGCTTCCTTGGGTTGAAGGCCTTTTAGAATA 
TGG AGAGAAAT ACT ATAGTC AGC AC AAAAAAC C ATT ATTTTCTTC AC AT ATGTT AGATTT ATC AGAAG AAC C T ATT 
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AAAGAAAATATTGAAATTTCTAAAAAATTCTTAGAAAGAATGGCAAAAATTGAAATGTTTTTGGAAATAGAGCTTG 
GAATTACGGGTGGGGAAGAGGATGGAGTTGACAATTCAGATAGAGCTTTGCATGAACTATTTTCTACTCCTGAGGA 
TATTTATTATGGATATTCAGAACTTTTAAAAGTTAGCCCAAATTTTCAGATTGCAGCAGCTTTTGGAAATGTTCAT 
GGGGTATATAAACCGGGGAATGTTAAGCTTACTCCAAAAGTTTTAAAAGATGGTCAAGATTATGTCATATCAAAAA 
CAGGAGTAAATATGGCTAAGCCAGTTTCTTATGTTTTTCATGGAGGGTCTGGATCTACAATTGATGAGATTAATGA 
GGCGCTTTCTTATGGCGTTGTAAAGATGAATATTGACACAGATACACAGTGGGCTGCCTGGGAGGGTGTTTTAAAT 
TATTACAAAAAAAATGAAAGTCGTTTGCAAGGTCAATTAGGAGATGGCAAGGATATTGATATTCCAAATAAGAAAT 
TTT ATG ATCC AAGGGTTTGGTT AAGAGAAGG TGAAGTTTCT ATGAAAGAC C GTGTG AAGATTGC ATGC AAAAATC T 
TAATAATATTAATAGAAATTAA 

t.477.nt 

ATGCATGTTCATTTAATGGCAGAGCATTATGGTGTTCCTGTTGTTCTTCATACTGATCACTGTGCTAAAAATTTGC 

TTCCTTGGGTTGAAGGCCTTTTAGAATATGGAGAGAAATACTATAGTCAGCACAAAAAACCATTATTTTCTTCACA 

TATGTTAGATTTATCAGAAGAACCTATTAAAGAAAATATTGAAATTTCTAAAAAATTCTTAGAAAGAAT 

ATTGAAATGTTTTTGGAAATAGAGCTTGGAATTACGGGTGGGGAAGAGGATGGAGTTGACAATTCAGATAGAGCTT 

TGCATGAACTATTTTCTACTCCTGAGGATATTTATTATGGATATTCAGAACTTTTAAAAGTTAGCCCAAATTTTCA 

GATTGCAGCAGCTTTTGGAAATGTTCATGGGGTATATAAACCGGGGAATGTTAAGCTTACTCCAAAAGTTTTAAAA 

GATGGTCAAGATTATGTCATATCAAAAACAGGAGTAAATATGGCTAAGCCAGTTTCTTATGTTTTTCATGGAGGGT . 

CTGGATCTACAATTGATGAGATTAATGAGGCGCTTTCTTATGGCGTTGTAAAGATGAATATTGACACAGATACACA 

GTGGGCTGCCTGGGAGGGTGTTTTAAATTATTACAAAAAAAATGAAAGTCGTTTGCAAGGTCAATTAGGAGATGGC 

AAGGATATTGATATTCCAAATAAGAAATTTTATGATCCAAGGGTTTGGTTAAGAGAAGCTGAAGTTTCTATGAAAG 

ACCGTGTGAAGATTGCATGCAAAAATCTTAATAATATTAATAGAAATTAA 

f 488 .aa 

MPSSFPFLLVNGSSGIAVGMATNMAPHNLREICDAIVYMLDNENASIFDLL^ 

TGKGSWIRARYHIEERAEDRNAIIVTEIPYTWKSALLMKVALLAKEEKLSGLLDIRDESDREGIRIVLEVKRGF 
DPKVIMNLLYEYTEFKKHFSINNLALVNGIPKQLNLEELLFEFIEH 

NNIDEVIKIIKSSKLAKDARERLVSNFGLSEIQANSVLDMRLQKLTALEIFKLEEELNILLSLIKDYEDILLNPVR 
IINIIREETINLGLKFGDERRTKIIYDEEVLKTSMSDLMQKENIVVMLTKKGFLKRLSQNEYKLQGTGGKGLSSFD 
LNDGDEIVIALCVNTHDYLFMISNEGKLYLINAYEIKDSSRASKGQNISELINLGDQEEILTIKNSKDLTDDAYLL 
LTTASGKIARFESTDFKAVKSRGVIVIKLNDKDFVTSAEIVFKDEKVICLSKKGSAFIFNSRDVRLTNRGTQGVCG 
. MKLKEGDLFVKVLSVKENPYLLIVSENGYGKRLNMSKISELKRGATGYTSYKKSDKKAGSVVDAIAVSEDDEILLV 

S KR S KALRTVAGKVS EQGKD ARG IQ VL. F LDNDS LVS VS KF I KZ 
t488 .aa. ■ • 

MATNMAPHNLREICDAIVYMLDNENASIFDLLKIVKGPDF 

DRJNAIIVTEIPYTVNKSALLMKVALIxAKEE 

SINNLALWGIPKQLNLEELLFEFIEHRKNII^ 

RERLVSNFGLSEIQANSVLDMRLQKLTALEIFKLEEELNILLSLIKDY'EDILLNPVRIINIIREETINLGLKFGDE 
RRTKIIYDEEVLKTSMSDLMQKENIVVMLTKKGFLKRL^^ 

FMISNEGKLYLINAYEIKDSSRASKGQNISELINLGDQEEILTIKNSKDLTDDAYLLLTTASGKIARFESTDFKAV 
KSRGVIVIKLNDKDFVTSAEIVFKDEK^ICLSKKGSAFIFNSRDVRLTNRGTQGVCGMKLKEGDLFVKVLSVKENP 
YLLIVSENGYGKRLNMSXISELKRGATGYTSYKKSDKKAGSVVDAIAVSEDDEILLVSKRS KALRTVAGKVS EQGK 

DARGIQVLFLDNDSLVSVSKFIKZ 
f483.nt 

ATGCCGTCATCATTTCCATTTCTTTTGGTAAATGGCTCTAGTGGAATTGCTGTTGGAATGGCTACTAATATGGCAC 
CTCATAATTTAAGAGAAATTTGTGATGCCATTGTTTACATGCTAGATAATGAGAATGCTTCTATATTTGATTTGCT 
T AAAATAGTTAAAGGGC CTGATTTC C C AACTTTTGGAG AGATTGTTT ATAATGATAATTTAATT AAAGC AT AC AAA 
ACTGGCAAGGGAAGTGTTGTTATTAGGGCAAGATATCATATTGAAGAAAGAGCAGAAGATAGAAATGCTATAATTG 
TTACAGAAATACCTTATACGGTAAATAAATCTGCACTTCTTATGAAAGTTGCGCTTTTAGCAAAAGAAGAAAAGCT 
AGAAGGACTTTTAGATATAAGAGATGAATCTGATCGAGAAGGTATTAGGATAGTTCTTGAAGTTAAAAGAGGATTT 
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GATCCTCATGTTATTATGAATTTGCTTTATGAATATACTGAATTTAAAAAGCATTTTAGTATAAATAATTTAGCCC 
TTGTTAATGGTATTCCCAAACAGTTAAATTTAGAAGAATTGTTATTTGAATTTATTGAGCATAGAAAAAATATTAT 
CGAJ^GACGTATTGAATTTGACTTGAGAAAGGCAAAAGAGAAAGCACATGTTCTTGAGGGATTAAATATTGCTTTA 
AJ\TAATATAGATGAGGTTATTAAGATTATTAAATCATCTAAATTAGCAAAAGATGCAAGGGAGAGGCTTGTTTCGA 
ATTTTGGTCTTTCAGAGATTCAGGCCAATTCAGTTCTTGATATGAGGTTACAAAAACTTACAGCCGTTGAGATTTT 
TAAGCTTGAAGAGGAGCTTAATATACTGTTAAGCTTAATAAAAGATTATGAAGATATTCTCTTGAATCCAGTAAGG 
ATTATTAATATTATAAGAGAAGAAACTATTAATTTAGGTTTGAAATTTGGCGATGAACGTCGAACTAAAATAATTT 
ATGATGAGGAGGTTTTAAAAACTAGTATGTCGGATTTAATGCAAAAAGAAAATATTGTTGTTATGCTTACAAAGAA 
AGGTTTCCTTAAAAGACTTTCACAAAATGAGTATAAATTGCAAGGTACGGGAGGAAAAGGACTAAGTTCGTTTGAT 
CTAAATGATGGAGATGAGATTGTTATTGCTTTGTGTGTCAATACTCATGATTATTTATTTATGATTTCAAATGAAG 
GAAAGCTTTATTTAATCAATGCTTATGAAATAAAAGATTCTTCAAGAGCTTCAAAAGGTCAGAATATTAGTGAGCT 
TATTAATTTAGGAGATCAAGAAGAAATATTAACTATTAAGAATAGTAAAGATTTAACTGATGATGCTTATTTATTG 
CTTACAACTGCAAGTGGAAAGATAGCTAGATTCGAATCTACAGATTTTAAAGCAGTAAAGTCACGAGGTGTTATTG 
TTATTAAACTGAATGATAAAGATTTTGTTACAAGTGCAGAGATTGTTTTTAAGGATGAAAAAGTAATTTGTCTTTC 
TAAAAAGGGTAGTGCATTTATATTTAATTCAAGGGATGTTAGGCTTACTAATAGAGGTACCCAAGGTGTTTGTGGA 
ATGAAATTAAAAGAAGGTGATTTGTTTGTTAAAGTTTTATCGGTTAAAGAAAATCCTTATCTTTTGATTGTTTCTG 
AAAATGGGTATGGAAAAAGGTTAAACATGTCTAAAATATCTGAGCTTAAAAGAGGAGCCACTGGTTATACTAGTTA 
TAAJU^AATCTGATAAAAAAGCGGGTAGTGTTGCT 

AGTAAJVCGTTCAAAAGCTTTAAGAACAGTAGCTGGAAAAGTATCTGAACAAGGCAAAGATGCTAGAGGAATTCAAG 
TATTATTTCTTGATAATGACAGCTTGGTTTCTGTTTCAAAATTTATTAAATAA 

t483 .nt • * " ' ' 

ATGGCTACTAATATGGCACCTCATAATTTAAGAGAAATTTGTGATGCCATTGTTTACATGCTAGATAATGAGAATG 
CTTCTATATTTGATTTGCTTAAAATAGTTAAAGGGCCTGATTTCCCAACTTTTGGAGAGATTGTTTATAATGATAA 
TTTAATTAAAGCATACAAAACTGGCAAGGGAAGTGTTGTTATTAGGGCAAGATATCATATTGAAGAAAGAGCAGAA 
GATAGAAATGCTATAATTGTTACAGAAATACCTTATACGGTAAATAAATCTGCACTTCTTATGAAAGTTGCGCTTT 
TAGCAAAAGAAGAAAAGCTAGAAGGACTTTTAGATATAAGAGATGAATCTGATCGAGAAGGTATTAGGATAGTTCT 
TGAAGTTAAAAGAGGATTTGATCCTCATGTTATTATGAATTTGCTTTAT.GAATATACTGAATTTAAAAAGCATTTT 
AGTATAAATAATTTAGCCCTTGTTAATGGTATTCCCAAACAGTTAAATTTAGAAGAATTGTTATTTGAATTTATTG 
AGCATAGAAAAAATATTATCGAAAGACGTATTGAATTTGACTTGAGAAAGGCAAAAGAGAAAGCACATGTTCTTGA 
GGGATTAAATATTGCTTTAAATAATATAGATGAGGTTATTAAGATTATTAAATCATCTAAATTAGCAAAAGATGCA 
AGGGAGAGGCTTGTTTCGAATTTTGGTCTTTCAGAGATTCAGGCCAATTCAGTTCTTGATATGAGGTTACAAAAAC 
TTACAGCCCTTGAGATTTTTAAGCTTGAAGAGGAGCTTAATATACTGTTAAGCTTAATAAAAGATTATGAAGATAT 
TCTCTTGAATCCAGTAAGGATTATTAATATTATAAGAGAAGAAACTATTAATTTAGGTTTGAAATTTGGCGATGAA 
CGTCGAACTAAAATAATTTATGATGAGGAGGTTTTAAAAACTAGTATGTCGGATTTAATGCAAAAAGAAAATATTG 
TTGTTATGCTTACAAAGAAAGGTTTCCTTAAAAGACTTTCACAAAATGAGTATAAATTGCAAGGTACGGGAGGAAA 
AGGACTAAGTTCGT-TTGATCTAAATGATGGAGATGAGATTGTTATTGCTTTGTGTGTCAATACTCATGATTATTTA 
TTTATGATTTCAAATGAAGGAAAGCTTTATTTAATCAATGCTTATGAAATAAAAGATTCTTCAAGAGCTTCAAAAG 
GTCAGAATATTAGTGAGCTTATTAATTTAGGAGATCAAGAAGAAATATTAACTATTAAGAATAGTAAAGATTTAAC 
TGATGATGCTTATTTATTGCTTACAACTGCAAGTGGAAAGATAGCTAGATTCGAATCTACAGATTTTAAAGCAGTA 
AAGTCACGAGGTGTTATTGTTATTAAACTGAATGATAAAGATTTTGTTACAAGTGCAGAGATTGTTTTTAAGGATG 
AAAAAGTAATTTGTCTTTCTAAAAAGGGTAGTGCATTTATATTTAATTCAAGGGATGTTAGGCTTACTAATAGAGG 
TACCCAAGGTGTTTGTGGAATGAAATTAAAAGAAGGTGATTTGTTTGTTAAAGTTTTATCGGTTAAAGAAAATCCT 
TATCTTTTGATTGTTTCTGAAAATGGGTATGGAAAAAGGTTAAACATGTCTAAAATATCTGAGCTTAAAAGAGGAG 
CCACTGGTTATACTAGTTATAAAAAATCTGATAAAAAAGCGGGTAGTGTTGTTGATGCTATAGCAGTTTCAGAGGA 
TGATGAAATCTTGCTTGTAAGTAAACGTTCAAAAGCTTTAAGAACAGTAGCTGGAAAAGTATCTGAACAAGGCAAA 
GATGCTAGAGGAATTCAAGTATTATTTCTTGATAATGACAGCTTGGTTTCTGTTTCAAAATTTATTAAATAA 

f494.aa . ' ' 

MFALIRKIFMIYFLCITLAGFAMIFIDSKFTEQPNVKENQSKINQKTIEPNLIMFTSSIGGFLGVWGIWIFNYD^ 
SNFYLNWGNLIILIYNIALIITVYSKSHS 



t494 ,aa 



TABLE 1. Nucleotide and Amino Acid Sequences 

MIFIDSKFTEQPNVKENQSKINQHTIEPNLIMFTSSIGGFLGVWGIWIFNYDKSNFYLNWGNLIILIYNIALIIT 
. VYSKSHS 

f494.nt ' 

ATGTTTGCATTAATTAGAAAAATATTTATGATCTATTTTTTATGCATTACTCTTGCAGGTTTTGCCATGATTTTTA 
TTGACAGCAAATTTACCGAACAGCCTAATGTTAAAGAAAATCAAAGCAAAATTAATCAACATACAATTGAACCCAA 
TTTAATCATGTTTACATCTTCTATAGGAGGATTTTTAGGTGTTTATGTTGGAATTTGGATCTTTAACTATGACAAA 
AGCAATTTTTACCTAAATTGGGGAAATTTAATAATATTAATATACAACATAGCCCTAATTATCACTGTATA.CTCAA 
AATCACATAGTTAG 

t494.nt " 

ATGATTTTTATTGACAGCAAATTTACCGAACAGCCTAATGTTAAAGAAAATCAAAGCAAAATTAATCAACATAC^ 
TTGAACCCAATTTAATCATGTTTACATCTTCTATAGGAGGATTTTTAGGTGTTTATGTTGGAATT-TGGATCTTTAA 
CTATGACAAAAGCAATTTTTACCTAAATTGGGGAAATTTAATAATATTAATATACAACATAGCCCTAATTATCACT- 

GTATACTCAAAATCACATAGTTAG ' „ 

f 516 .aa ' " ' ~ 

MKKTPNTCIFLTLLIISNLNALANSEGNTNEKNDQPKQISNFFSPERGFIYSTGIGIGVGFFLNSNIKHLIFRPYY 
TFSNWTFDFLIVAMILTRESLNIPKKMQYFKSYIGGGINWHIANLIKKTKYFSATIGIGGRFYLSTNFIEDIRFYE 
KLPYVIEPYMFIEISSKKAIPLMGLDFKXDFLFLDTFNISFNFTXRYNFKDKNEMET 

t516.aa 

NEEGNTNEKNDQPKQISNFFSPERGFIYSTGIGIGVGFFLNSNIKHLIFRPYYTFSNNTFDFLIVAMILTRESLNI 

■ PKKMQYFKSYIGGGINWHIANLIKKTKYFSATIGIGGRFYLSTNFIEDIRFYEKLPWIEPYMFIEISSKKAIPLM 
GLDFKIDFLFLDTFNISFNFTIRYNFKDKNEMET ■ 

■ f 51*6 .nt < 

ATGAAAAAAACTCCAAACACTTGTATTTTCTTAACATTGCTTATCATTTCCAATTTAAATGCACTTGCAAATGAAG 
AAGGCAATACTAATGAAAAAAATGATCAACCCAAACAAATCTCTAATTTTTTTAGCCCAGAAAGAGGGTTCATATA 
TTCAACAGGAATTGGGATTGGAGTTGGATTTTTTCTAAATTCAAATATTAAACACCTTATCTTTAGACCTTATTAT 
ACATTCTCTAATAATACTTTTGATTTTTTAATCGTTGCTATGATATTAACAAGGGAAAGCCTTAATATCCCCAAAA 
AAATGCAATACTTTAAATCTTATATTGGAGGAGGAATAAACTGGCACATTGCAAACTTAATTAAAAAAACAAAATA 
TTTTTCCGCCACCATTGGCATAGGTGGTCGTTTTTACCTATCTACAAACTTTATAGAAGACATTCGATTTTACGAA 
AAATTGCCTTATGTAATAGAGCCTTATATGTTTATTGAAATTTCTTCTAAAAAGGCAATTCCTTTAATGGGGTTAG 
ACTTTAAAATTGATTTTTTATTTTTAGATACATTTAACATTTCTTTTAATTTTACTATTAGATATAATTTTAAGGA 

CAAAAACGAGATGGAAACATGA 

t516.nt * 
AATGAAGAAGGCAATACTAATGAAAAAAAT 

TCATATATTCAACAGGAATTGGGATTGGAGTTGGATTTTTTCTAAATTCAAATATTAAACACCTTATCTTTAGACC 
TTATTATACATTCTCTAATAATACTTTTGATTTTTTAATCGTTGCTATGATATTAACAAGGGAAAGCCTTAATATC 

' CCCAAAAAAATGCAATACTTTAAATCTTATATTGGAGGAGGAA 
CAAAATATTTTTCCGCCACCATTGGCATAGGTGGTCGTTTTTACCTATCTACAAACTTTATAGAAGACATTCGATT 
TTACGAAAJ^TTGCCTTATGTAATAGAGCCTTATATGTTTATTGAAATTTCTTCTAAAAAGGCAATTCCTTTAATG 
GGGTTAGACTTTAAAATTGATTTTTTATTTTTAGATACATTTAACATTTCTTTTAATTTTACTATTAGATATAATT 

TTAAGGACAAAAACGAGATGGAAACA.TGA 

f517.aa - ■ * 
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mipwasggilialsiafvgigpdgpnfaekpfykqiadigsiafgmmlpvlagfiamaiadkpgltpglvggvms 
gnvl<agflgaifagflagyyarflarrsvpew 

snsetfgvlgkiflglvlg'smitvdmggpfnkvaflfgvglipqvpeimgmvaaaipvppmamglatflapklfen 
eekesgkiaflisfigisegaipfaasdpgrvipsiwggavssiiaaflgvanhaphggpivlpvidnkfgfiia 
i avgvavatal vi f lks lklkes e 

t517 .aa 

DKPGLTPGLVGGVMSGNVKAGFLGAIFAGFLAGYVARFLARRSVPEW 

GKFMGVLESGLKSLQSNSETFGVLGKIFLGLVLGSMITVDMGGPFNKVAFLFGVGLIPQVPEIMGMVAAAIPVPPM 
AMGLATFLAPKLFENEEKESGKIAFLI SFIGI SEGAI PFAASDPGRVI PS I WGGAVS S 1 I AAFLGVANHAPHGG P 
IVLPVIDNKFGFIIAIAVGVAVATALVIFLKSLKLKESE 

f517.nt ; 

ATGATTCCTGTTGTTGCAAGTGGAGGAATTTTAATTGCTCTTAGCATTGCTTTTGTTGGGATTGGACCTGATGGGC 
CTAATTTTGCTGAGCATCCATTTTATAAGCAGATTGCAGATATTGGTTCTATAGCTTTTGGGATGATGTTGCCCGT 
GCTTGCTGGTTTTATTGCAATGGCAATTGCTGATAAGCCTGGTCTTACCCCCGGTCTTGTTGGTGGAGTAATGTCT 
GGGAATGTAAAAGCAGGTTTCTTGGGCGCAATATTTGCGGGCTTTCTTGCAGGTTATGTTGCAAGGTTTTTAGCAA 
GAAGATCTGTTCCTGAGTGGTTAAGACCTGTAATGCCTATATTTGTAATTCCGCTAATAAGCACCATTATTGTCGG 
CTTTTTTATGCTGTATTTTGGTGTTTATATTGGAAAATTTATGGGGGTGCTTGAGAGTGGGCTTAAATCTTTACAG 
AGTAATTCGGAAACTTTTGGCGTGTTGGGTAAAATTTTCTTAGGCTTAGTACTAGGTTCAATGATTACTGTTGATA 
TGGGCGGACCTTTTAATAAAGTGGCATTTCTTTTTGGTGTAGGGCTAATTCCTCAAGTGCCAGAAATAATGGGAAT- 
GGTAGCAGCAGCAATTCCTGTTCCTCCTATGGCTATGGGGCTTGCAACCTTTTTAGCACCTAAATTGTTTGAAAAT 
GAAGAAAAAGAATCTGGTAAAATAGCCTTTTTAATTTCATTTATTGGTATTAGCGAAGGAGCTATTCCTTTTGCTG 
CTAGTGATCCCGGACGGGTAATCCCTTCGATAGTGGTAGGGGGAGCTGTATCAAGCATTATTGCCGCTTTTTTAGG 
C'GTTGCTAATCATGCTCCACACGGAGGACCAATAGTACTTCCTGTTATTGATAATAAATTTGGGTTTATTATTGCA 
ATTGCTGTTGGAGTTGCGGTTGCAACAGCTTTGGTAATTTTTTTGAAATCTTTAAAATTAAAGGAATCTGAATGA 

t517.nt 

GATAAGCCTGGTCTTACCCCCGGTCTTGTTGGTGGAGTAATGTCTGGGAATGTAAAAGCAGGTTTCTTGGGCGCAA 
TATTTGCGGGCTTTCTTGCAGGTTATGTTGCAAGGTTTTTAGCAAGAAGATCTGTTCCTGAGTGGTTAAGACCTGT 
- AATGCCTATATTTGTAATTCCGCTAATAAGCACCATTATTGTCGGCTTTTTTATGCTGTATTTTGGTGTTTATATT 
GGAAAATTTATGGGGGTGCTTGAGAGTGGGCTTAAATCTTTACAGAGTAATTCGGAAACTTTTGGCGTGTTGGGTA 
AAATTTTCTTAGGCTTAGTACTAGGTTCAATGATTACTGTTGATATGGGCGGACCTTTTAATAAAGTGGCATTTCT 
TTTTGGTGTAGGGCTAATTCCTCAAGTGCCAGAAATAATGGGAATGGTAGCAGCAGCAATTCCTGTTCCTCCTATG 
GCTATGGGGCTTGCAACCTTTTTAGCACCTAAATTGTTTGAAAATGAAGAAAAAGAATCTGGTAAAATAGCCTTTT 
TAATTTCATTTATTGGTATTAGCGAAGGAGCTATTCCTTTTGCTGCTAGTGATCCCGGACGGGTAATCCCTTCGAT 
AGTGGTAGGGGGAGCTGTATCAAGCATTATTGCCGCTTTTTTAGGCGTTGCTAATCATGCTCCACACGGAGGACCA 
ATAGTACTTCCTGTTATTGATAATAAATTTGGGTTTATTATTGCAATTGCTGTTGGAGTTGCGGTTGCAACAGCTT 
TGGTAATTTTTTTGAAATCTTTAAAATTAAAGGAATCTGAATGA 

f519.aa ' * 

MIKIFKKIYILTLVLGMAHLSFASDNYMVRCSKEEDSTTCIAKLKEIKEKKI^DLFSMGIGIGDPIANIMITIPYi 
NIDFGYGGFIGLKSNNFENYLNGGIDVIFKXQIGQYMKIGGGIGIGADWSKTSLIPPNEEEETDYERIGAVIRIPF 
IMEYNFAKNLSIGFKIYPAVGPTILLTKPSILFEGIKFNFFGFGFIKFAFN 

t519.aa 

DNYMVRCSKEEDSTTCIAKLKEIKEKKNYDLFSMGIGIGDPIANIMITIPYINIDFGYGGFIGLKSNNFENYLNGG 
IDVIFKXQIGQYMKIGGGIGIGADWSKTSLIPPNEEEETDYERIGAVIRIPFIMEYNFAKNLSIGFKIYPAVGPTI 

LLTKPSILFEGIKFNFFGFGFIKFAFN 
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f519.nt 

ATGATAAAAATTTTTAAAAAAATATACATTTTAACATTAGTATTAGGTATGGCACACCTTTCTTTTGCATCTGACA 
ATTATATGGTCAGATGCAGCAAGGAAGAAGATTCAACCACCTGTATCGCAAAGCTTAAAGAAATAAAAGAAAAGAA 
AAATTATGACTTATTTTCAATGGGCATTGGAATAGGAGATCCTATTGCAAATATTATGATTACAATTCCTTATATA 
AATATTGATTTTGGATATGGAGGTTTTATTGGCCTTAAGTCAAACAATTTTGAAAATTATCTAAATGGTGGAATAG 
ACGTTATTTTTAAAAAGCAAATTGGACAATATATGAAAATTGGCGGCGGCATTGGAATAGGTGCGGATTGGTCAAA 
AAC ATCCC TT AT AC C C C C T AATG AAG AAG AAG AAAC TG ATT ATG AG AG AAT AGGC GC TGTT AT AAG AATTC C TTTT 
ATAATGGAATATAATTTTGCAAAAAATTTATCCATAGGATTCAAAATTTATCCTGCAGTAGGGCCAACAATATTAC 
TAACAAAACCAAGCATTTTATTTGAAGGAATTAAATTCAATTTTTTTGGATTTGGATTCATAAAATTTGCATTTAA 

TTAA 

t519.nt * . 

GACAATTATATGGTCAGATGCAGCAAGGAAGAAGATTCAACCACCTGTATCGCAAAGCTTAAAGAAATAAAAGAAA 
AGAAAAATTATGACTTATTTTCAATGGGCATTGGAATAGGAGATCCTATTGCAAATATTATGATTACAATTCCTTA 
TATAAATATTGATTTTGGATATGGAGGTTTTATTGGCCTTAAGTCAAACAATTTTGAAAATTATCTAAATGGTGGA 
ATAGACGTTATTTTTAAAAAGCAAATTGGACAATATATGAAAATTGGCGGCGGCATTGGAATAGGTGCGGATTGGT 
CAAAAACATCCCTTATACCCCCTAATGAAGAAGAAGAAACT^^ 

TTTTATAATGGAATATAATTTTGCAAAAAATTTATGCATAGGATTCAAAATTTATCCTGCAGTAGGGCCAACAATA 
TTACTAACAAAACCAAGCATTTTATTTGAAGGAATTAAATTCAATTTTTTTGGATTTGGATTCATAAAATTTGCAT 
TTAATTAA 

f52'0.aa" : 

MRMLLATI'ILILTTGLLAAQSKSKSMTEDDFDFDKLLAKEESVRRLFGIGFGVGYPLANITISVPYVDIDLGYGGF 
VGLKPNNFLPYVVMGVDLLFKDEIHKNTMISGG 

PLVIEYSFLKNIVIGFKAVATIGTTMLLGSPMSFEGARFNFLGTGFIKIYI 
t520 .aa 

QSKSKSMTEDDFDFDKLLAKEESVRRLFGIGFGVGYPI^^ITISVPYVDIDLGYGGFVGLKP^FNFLPYVVMGVDrJL < 
FKDEIHKNTMISGGIGIGADWSKGSPEKSNEKLEEEEENEAQQVASLQNRIGWIRLPLVIEYSFLKNIVIGFKAV 
ATIGTTMLLGSPMSFEGARFNFLGTGFIKIYI 

f520.nt 

' - - 

ATGAGAATGCTATTAGCAACAATAATACTTATATTAACAACGGGTTTATTAGCTGCACAATCCAAAAGCAAAAGTA 
-TGACTGAAGATGACTTTGATTTTGATAAACTTCTTGCAAAAGAAGAGTCTGTGCGCCGTTTATTTGGCATAGGTTT 

TGGAGTTGGATATCCACTTGCAAACATTACAATATCTGTTCCATATGTAGACATAGACCTTGGGTACGGAGGATTC 
'GTAGGGCTTAAACCCAACAATTTCTTGCCCTATGTTGTGATGGGTGTAGATCTTCTATTTAAAGATGAAATACACA 

AAAACACTATGATTTCTGGAGGCATTGGAATAGGTGCAG^ 

ACTTGAAGAAGAGGAAGAAAATGAAGCACAACAAGTAGCTTCTCTTCAAAATAGAATAGGGGTTGTGATAAGATTG 
CCTTTGGTAATAGAGTACAGCTTTCTTAAAAATATTGTGATTGGATTTAAAGCTGTTGCTACTATTGGAACAACTA 
TGCTACTTGGCAGCCCAATGTCATTTGAAGGAGCTAGATTTAATTTCTTAGGCACAGGCTTTATAAAAATATATAT 

ATAG ' 
t520.nt 

CAATCCAAAAGCAJ^AAGTATGACTGAAGATGACTTTGATTTTGATAAACTTCTTGCAAAAGAAGAGTCTGTGCGCC 

GTTTATTTGGCATAGGTTTTGGAGTTGGATATCCACTTGCAAACATTACAATATGTGTTCCATATGTAGACATAGA 

CCTTGGGTACGGAGGATTCGTAGGGCTTAAACCCAACAATTTCTTGCCCTATGTTGTGATGGGTGTAGATCTTCTA 

TTTAAAGATGAAATACACAAAAACACTATGATTTCTGGAGGCATTGGAATAGGTGCAGATTGGTCAAAAGGAAGTC 

CTGAAAAATCAAATGAAAAACTTGAAGAAGA.GGAAGAAAATGAAGCACAACAAGTAGCTTCTCTTCAAAA 

AGGGGTTGTGATAAGATTGCCTTTGGTAATAGAGTACAGCTTTCTTAAAAlATATTGTGATTGGATTTAAAGCTGTT 



92 

TABLE 1. Nucleotide and Amino Acid Sequences 



f52S.aa 

MKKEFIMLLLLLQTIMNLNSINTNTSTSIVKELQKNLYIFNSKEYQKDKDTLNEFINSININDKEILQSLEKIKNS 
LF 1 1 SVFFNNKKGI LI ALNLGAEINFKYKI SPI S I S I INNEFEITKI LI DYGI SLNQIDDTGYS P I FWAI YTNNEK 
.IFEFLKESGADLSFTLKNRKTPMQAAIETENIKLIKSLEKKKIYIDDNFKKKLKKLKNKEIVRILVK 

t526 . aa 

NSINTNTSTSIVKELQKNLYIFNSKEYQKDKDTLNEFINSININDKEILQSLEKIKNELFIISVFFNNKKGILIAL 
NLGAEINFKYKISPISISIINNEFEITKILIDYGISLNQIDDTGYSPIFWAIYTNNEKIFEFLKESGADLSFTLKN 
RKTPMQAAIETENIKLIKSLEKKKIYIDDNFKKKLKKLKNKEIVRILVK 

f526 /nt 

ATGAAAAAAGAATTCATTATGCTTTTACTGTTATTGCAAACAATAATGAATTTAAACTCAATAAATACTAATACAA 
GTACTTCAATAGTAAAAGAATTGCAAAAAAATTTATATATTTTCAATAG^ 

TTTAAATGAATTTATAAATTCAATAAATATAAATGACAAAGAAATCTTACAAAGTTTAGAAAAAATC 

CTTTTTATAATATCTGTTTTTTTCAACAATAAAAAAGGGATTTTAATTGCACTAAATCTTGGAGCAGAAATAAAGT 

TTAAATATAAAATATCTCCAATTTCAATTTCAATAATAAACAATGAATTTGAAATCACAAAAATATTGATAGATTA 

CGGAATAAGCCTTAATCAAATAGATGATACAGGTTATTCTCCAATATTTTGGGCAATATATACTAATAACGAAAAA 

ATATTTGAATTTTTAAAAGAAAGCGGAGCTGATTTAAGTTTCACACTTAAAAATAGAAAAACACCAATGCAAGCCG 

CAATAGAAACAGAAAATATAAAACTAATTAAATCTCTGGAAAAGAAAAAAATTTACATTGACGAC 

AAAACTTAAAAAGCTTAAAAACAAAGAAATAGTTCGAATTTTAGTAAAATAG 

t526.nt 

AACTCAATAAATACTAATACAAGTACTTCAATAGTAAAAGAATTGCAAAAAAATTTATATATTTTGAATAGCAAA^ 

AATATCAAAAAGATAAAGACACTTTAAATGAATTTATAAATTCAATAAATATAAATGACAAAGAAAT 

TTTAGAAAAAATCAAAAATGAGCTTTTTATAATATCTGTTTTTTTCAACAATAAAAAAGGGATTTTAATTGCACTA 

AATCTTGGAGCAGAAATAAACTTTAAATATAAAATATCTCCAATTTCAATTTCAATAATAAACA^ 

TCACAAAAATATTGATAGATTACGGAATAAGCCTTAATCAAATAGATGATACAGGTTATTCTCCAATATTTTGGGC 

AATATATACTAATAACGAAAAAATATTTGAATTTTTAAAAGAAAGCGGAGCTGATTTAAGTTTCACACTTAAAAAT 

AGAAAAACACCAATGCAAGCCGCAATAGAAACAGAAAATATAAAACTAATTAAATCTCTGGAAAAGAAA 

ACATTGACGACAATTTCAAAAAAAAACTTAAAAAGCTTAAAAACAAAGAAATAGTTCGA^ 

f544.aa 

MTKNRIIWLLVLMVSSTFTATIISNYQNLMLSLVV 

LKEICVSILVGAILASVNFLRIVFFVAPHHSDKLKIAFWSSCLMVSLTVAKILGGLLPIVAKLLKLDPALMAGPL 
ITT I ADAITL I AYFNI AKWVLVS YAV 

t544.aa / - 

S T FTAT 1 1 S NYQNLML S L WL ANF I P LL^ 

SVNFLRIVFFVAPHHSDKLKIAFVVSSCLMVSLTVAKILGGLLPIVAKLLKLDPALMAGPLITTIADAITLIAYFN 
IAKWVLVSYAV 

f544.nt ; 

ATGACAAAAAATAGAATAATTTGGCTTTTAGTTCTTATGGTGTCTTCTACTTTTACAGCTACAATTATTTCAAATT 
ATCAAAATTTAATGTTGTCTTTAGTGGTTTTAGCTAATTTTATTCCCCTTTTAATGGATACTTCAGGCAATGCCGG 
CTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTACTGTCAAGGTAAAAGATTTTTTTAAAGTGTTT 
TTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCTAGTGTTAATTTTTTAAGAATTGTCTTTTTTG 
TAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTTCATCTTGCTTGATGGTAAGTTTGACAGTAGC 
AAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTTAAAGTTGGATCCAGCACTTATGGCAGGCCCTTTA 
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.GCTACTATTGGAACAACTATGCTACTTGGCAGCCCAATGTCATTTGAAGGAGCTAGATTTAATTTCTTAGGCACAG 
GC TTT AT AAAAAT AT AT AT AT AG 

£523. aa ' 

j 

MNIKINFFFTLPIGIFLGLFFPLGIYSSLSHAFIRLSYLSLIPFLIFSIPLGIENIIENKNFKKLFGKTIYYGILT 
NLSGVAVSIIAATIYLPQRIPILEKTIQNTCFFEKEALLETFFPKNIFKIFTSSNPNLLSIYMISIIIGTSFYYAK 
QKGRIARELMLSASNLFYHANGFIVNILNIGIIFI^ 

RLTKSFKMIYKGIFVSFQNIIFSGLAKDSYSPYVILIEDIKNERINIKKSIIINIPLINFVSKFGTIFVSVISFFI 
ILKSYSSLPISIYEISYMSTLSFVFVFAFPHIPNSLIYIITMLCSTYTKGIELNVSNITPMLPILISLALLIDFAF 
NIAIIHIINFKELKDQEKIN 

t523 .aa . . ' 

IENIIENKNFKKLFGKTIYYGILTNLSGVAVSIIAATIYLPQRIPILEKTIQNTCFFEKEALLETFFPKNIFKIFT 
SSNPNLLSIYMISIIIGTSFYYAKQKGRIARELMLSASNLFYHANGFIWILNIGIIFITANYAANLKNFKDYPl^ 
TNSITFFI^WTIIILFVILPTISYRLTKSFKMIYKGIFVSFQNIIFSGLAKDSYSPYVILIEDIKNERINIKKSII 
INIPLINFVSKFGTIFVSVISFFIILKSYSSLPISIYEISYMSTLSFWVFAFPHIPNSLIYIITMLCSTYTKGIE 
LNVSNITPMLPILISLALLIDFAFNIAIIHIINFKELKDQEKIN 

f523.nt ' 

ATGAATATAAAAATCAATTTTTTTTTCACTTTGCCTATTGGAATCTTTTTAGGATTGTTTTTCCCTCTTGGAATTT 
ATAGCTCCTTATCACATGCTTTTATAAGATTATCATACTTATCTCTTATTCCCTTTTTAATATTTTCAATTCCATT 
AGGAATTGAAAATATTATTGAAAATAAAAACTTTAAAAAGCTTTTTGGTAAAACAATTTATTATGGAATTTTAACT 
AACCTATCTGGAGTTGCTGTATCAATAATAGCTGCAACAATATATCTTCCGCAAAGAATTCCAATACTAGAAAAAA 
CAATACAAAATACATGTTTTTTTGAAAAAGAAGCTTTACTAGAAACATTCTTTCCAAAAAATATTTTCAAAATATT 
TACATCTAGCAATCCAAATCTACTAAGCATTTACATGATTTCAATAATAATAGGCACAAGTTTTTATTATGCAAAA 
CAAAAAGGCAGAATAGCTAGAGAACTGATGCTAAGCGCATCCAATCTTTTTTACCATGCAAATGGGTTTATTGTAA 
ACATATTAAATATAGGGATCATTTTTATAACAGCAAATTACGCTGCAAACTTAAAAAACTTCAAAGATTACCCAA^ 
TTATACAAACAGCATAACATTCTTTTTGGCATGGACAATTATAATTTTATTCGTAATATTGCCAACAATTAGTTAT 
AGATTAACAAAAAGTTTTAAAATGATATATAAAGGCATTTTTGTATCATTTCAAAACATAATATTTTCAGGACTTG 
CAAAAGATTCTTATTCCCCTTATGTGATATTAATAGAAGATATTAAAAACGAAAGAATAAATATAAAAAAATCCAT 
AATTATAAACATACCTTTAATAAATTTTGTATCTAAATTTGGCACTATTTTTGTTTCAGTAATATCATTTTTTATA 
ATTTTAAAATCATATTCTAGCTTACCCATTTCTATTTATGAAATAAGCTATATGAGCACTTTATCATTTGTTTTTG 
TCTTTGCATTTCCTCATATACCAAATAGTTTAATTTATATAATTACAATGCTTTGCTCTACATATACAAAAGGAAT 
AGAGCTAAATGTTTCAAACATAACACCAATGCTGCCGATATTAATCTCTTTGGCTTTACTAATCGACTTTGCTTTT 
AACATTGCAATCATTCATATAATAAACTTCAAAGAATTAAAAGATCAAGAAAAAATTAATTAA 

£523 .nt . " ■ , 

ATTGAAAATATTATTGAAAATAAAAACTTTAAAAAGCTTTTTGGTAJ^AACAATTTATTATGGAATTTTAACTAACC 
TATCTGGAGTTGCTGTATCAATAATAGCTGCAACAATATATCTTCCGCAAAGAATTCCAATACTAGAAAAAACAAT 
ACAAAATACATGTTTTTTTGAAAAAGAAGCTTTACTAGAAACATTCTTTCCAJ^AAAATATTTTCAAAATATTTACA 
TCTAGCAATCCAAATCTACTAAGCATTTACATGATTTCAATAATAATAGGCACAAGTTTTTATTATGCAAAACAAA 
AAGGCAGAATAGCTAGAGAACTGATGCTAAGCGCATCCAATCTTTTTTACCATGCAAATGGGTTTATTGTAAACAT 
ATTAAATATAGGGATCATTTTTATAACAGCAAATTACGCTGCAAACTTAAAAAACTTCAAAGATTACCCAAATTAT 
ACAAACAGCATAACATTCTTTTTGGCATGGACAATTATAATTTTATTCGTAATATTGCCAACAATTAGTTATAGAT 
TAACAAAAAGTTTTAAAATGATATATAAAGGCATTTTTGTATCATTTCAAAACATAATATTTTCAGGACTTGCAAA 
AGATTCTTATTCCCCTTATGTGATATTAATAGAAGATATTAAAAACGAAAGAATAAATATAAAAAAATCCATAATT 
ATAAACATACCTTTAATAAATTTTGTATCTAAATTTGGCACTATTTTTGTTTCAGTAATATCATTTTTTA.TAATTT 
TAAAATCATATTCTAGCTTACCCATTTCTATTTATGAAATAAGCTATATGAGCACTTTATCATTTGTTTTTGTCTT 
TGCATTTCCTCATATACCAAATAGTTTAATTTATATAATTACAATGCTTTGCTCTACATATACAAAAGGAATAGAG 
CTAAATGTTTCAAACATAACACCAATGCTGCCGATATTAATCTCTTTGGCTTTACTAATCGACTTTGCTTTTAACA 
TTGCAATCATTCATATAATAAACTTCAAAGAATTAAAJ^GATCAAGAAAJVAATTAATTAA 
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ATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAATATAGCAAAATGGGTTTTAGTTAGCTATGCTG 
TTTAA 

't544.nt ■ 

TCTACTTTTACAGCTACAATTATTTCAAATTATCAAAATTTAATGTTGTCTTTAGTGGTTTTAGCTAATTTTATTC 
CCCTTTTAATGGATACTTCAGGCAATGCCGGCTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTAC 
TGTCAAGGTAAAAGATTTTTTTAAAGTGTTTTTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCT 
AGTGTTAATTTTTTAAGAATTGTCTTTTTTGTAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTT 
CATCTTGCTTGATGGTAAGTTTGACAGTAGCAAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTTAAA 
. GTTGGATCCAGCACTTATGGCAGGCCCTTTAATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAAT 
ATAGCAAAATGGGTTTTAGTTAGCTATGCTGTTTAA . 

f545.aa 

MTKNRIIWLLVLMVSSTF TATIIS^jfQNLMLSLVVLANFIPLLMDTSGNAGSQASALI IRELALGTVKVKDFFKVF 

LKEICVSILVGAILASVNFLRIVFFVAPHHSDKLKIAFWS^ 

ITTIADAITLIAYFNIAKWVLVSYAV 

• t545 .aa ■ ■ , . 

GSQASALIIRELALGTVKVKDFFKVFLKEICVSILVGAILASWFLRIVFFVAPHHSDKLKIAFWSSCLMVSLTV" 
' AKILGGLLPIVAKLLKLDPALMAG PL ITTIADAITLIAYFNIAKWVLVSYAV . - - 

f 545 .nt 

ATGACAAAAAATAGAATAATTTGGCTTTTAGTTCTTATGGTGTCTTCTACTTTTACAGCTACAATTATTTCAAATT 
ATCAAAATTTAATGTTGTCTTTAGTGGTTTTAGCTAATTTTATTCCCCTTTTAATGGATACT.TCAGGCAATGCCGG 
CTCTCAGGCATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTACTGTCAAGGTAAAAGATTTTTTTAAAGTGTTT 
TTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCTAGTGTTAATTTTTTAAGAATTGTCTTTTTTG 
TAGCTCGACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTTCATCTTGCTTGATGGTAAGTTTGACAGTAGC 
AAAGATATTGGGAGGTCTTTTACCCATTGTTGCTAAACTTTTAAAGTTGGATCCAGCACTTATGGCAGGCCCTTTA 
ATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAATATAGCAAAATGGGTTTTAGTTAGCTATGCTG 
TTTAA 

t545.nt 

GGCTCTCAGG'CATCTGCGCTAATAATTCGTGAGCTTGCTCTTGGTACTGTCAAGGTAAAAGATTTTTTTAAAGTGT 
TTTTAAAGGAAATATGTGTTAGCATTCTAGTGGGAGCAATTCTTGCTAGTGTTAATTTTTTAAGAATTGTCTTTTT 
TGTAGCTCCACACCATTCTGATAAGCTGAAAATAGCTTTTGTAGTTTCATCTTGCTTGATGGTAAGTTTGACAGTA 
GCAAAGATATTGGGAGGTCTTTTA.CCCATTGTTGCTAAACTTTTAAAGTTGGATCCAGCACTTATGGCAGGCCCTT 
TAATCACTACAATTGCAGATGCTATTACTTTAATAGCTTATTTTAATATAGCAAAATGGGTTTTAGTTAGCTATGC 

TGTTTAA ■ - • 

f577.aa 

MRIKNLILIAILLISPSCSTNKNIVVLTDNKTIPFYINQFNIENKANFIIKFRNNiDLQTIEKENAQIIISKNIGN 
TNIANHFKSVKINYNPDYPILKHIFKQFKTK^ 

YVSENLFYVISQINNVRFSFEKNKLNYNENQILKMLEYFSSFLNTKQMDLQKDFFNKYGYLKLNKILLNKKSLLIA 
GLSDITFYNSLSEQEKSQIKFSYLINDNNEIVISNPNFIGILET5VLTKKFINWILYKKTQKTLIGFNNQSQSNIC 
FGFANGFTPYKELNLKIKKSIDGISPFIIDETQINSHSYVLSKKTIEKENLLINEWFFSKAKNLKKNKN 

t57 7 . aa * 

NKNI WLTDNKTI PFYINQFNIENKANF I IKFRNNIDLQTI EKENAQ 1 I ISKNIGNTNI ANHFKSVKINYNPDYP I 
LKHIFKQFNYKIIPLGFDIPILIYKNTHHIKKYINTKYLKEEYENFIKDGKFFISPYVSENLFYVISQINNVRFSF 
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EKNKLNYNENQILKMLEYFSSFLNTKQMDLQKDFFNKYGYLKLNKILLNKKSLLIAGLSDITFYNSLSEQEKSQIK 
FSYLINDNNEIVISNPNFIGILETSVLTKKFINWILYKKTQKTLIGFNNQSQSNICFGFANGFTPYKELNLKIKHS 
IDGISPFIIDETQINSHSYVLSKKTIEKENLLINEWFFSKANNLKKNKN * 

f577.nt ■ 

ATGAGAATAAAAAATTTAATACTAATAGCAATTTTATTAATTAGCCCTAGCTGTTCAACAAATAAGAACATCGTTG 
TACTAACTGACAATAAAACAATACCATTTTATATAAATCAATTTAATATAGAAAATAAAGCAAATTTTATAATTAA 
GTTTAGAAATAATATTGATCTGCAAACAATAGAAAAAGAAAATGCACAAATAATTATTTCTAAAAACATTGGTAAC 
ACAAATATTGCTAACCATTTTAAATCTGTAAAAATCAATTATAATCCAGATTATCCTATCTTAAAGCATATTTTCA 
AGCAATTTAACTACAAAATTATTCCATTGGGCTTTGACATTCCTATTTTAATCTATAAAAATACACATCATATTAA 
AAAATACATAAACACTAAATATCTAAAAGAAGAATACGAAAATTTCATTAAAGATGGAAAATTTTTTATATCGCCT 
TATGTTTCTGAAAATTTATTTTATGTGATTTCTCAAATAAATAATGTGAGATTTTCTTTTGAAAAAAATAAATTAA 
ATTATAATGAGAATCAAATTTTAAAAATGCTAGAATATTTCTCATCATTTTTAAATACAAAACAAATGGACTTGCA 
AAAAGATTTCTTTAATAAATACGGCTACCTAAAGTTAAATAAAATATTGCTTAATAAAA^AATCTCTTTTAATAGCA 
GGATTGAGCGATATAACCTTCTACAATAGCTTAAGCGAACAAGAGAAGTCACAAATAAAATTTTCCTATTTAATAA 
ACGATAACAATGAAATTGTTATCTCAAACCCAAATTTTATTGGCATTTTAGAAACATCTGTTTTAACTAAAAAATT 
TATCAACTGGATATTGTATAAAAAAAGTCAAAAAACCCTAATTGGATTTAAGAATCAATCCCAATCAAATATATGT 
TTTGGATTTGCCAATGGTTTTACCCCTTACAAAGAATTAAATTTAAAAATAAAACATTCAATTGATGGAATATCTC 
CTTTTATTATTGACGAAACTCAAATCAATAGCCATTCCTATGTATTAAGCAAAAAAACAATTGAAAAAGAAAACTT 
ACTAATAAATGAATGGTTTTTCTCTAAAGCTAATAATCTAAAAAAAAATAAAAATTAA - 

t577.nt 

AATAAGAACATCGTTGTACTAACTGACAATAAAACAATACCATTTTATATAAATCAATTTAATATAGAAAATAAAG 
CAAATTTTATAATTAAGTTTAGAAATAATATTGATCTGCAAACAATAGAAAAAGAAAATGCACAAATAATTATTTC 
TAAAAACATTGGTAACACAAATATTGCTAACCATTTTAAATCTGTAAAAATCAATTATAATCCAGATTATCCTATC 
TTAAAGCATATTTTCAAGCAATTTAACTACAAAATTATTCCATTGGGCTTTGACATTCCTATTTTAATCTATAAAA 
ATACACATCATATTAAAAAATACATAAACACTAAATATCTAAAAGAAGAATACGAAAATTTCATTAAAGATGGAAA 
ATTTTTTATATCGCCTTATGTTTCTGAAAATTTATTTTATGTGATTTCTCAAATAAATAATGTGAGATTTTCTTTT 
GAAAAAAATAAATTAAATTATAATGAGAATCAAATTTTAAAAATGCTAGAATATTTCTCATCATTTTTAAATACA^ 
AACAAATGGACTTGCAAAAAGATTTCTTTAATAAATACGGCTACCTAAAGTTAAATAAAATATTGCTTAATAAAAA 
ATCTCTTTTAATAGCAGGATTGAGCGATATAACCTTCTACAATAGCTTAAGCGAACAAGAGAAGTCACAAATAAAA 
TTTTCCTATTTAATAAACGATAACAATGAAATTGTTATCTCAAACCGAAATTTTATTGGCATTTTAGAAACATCTG 
TTTTAACTAAAAAATTTATCAACTGGATATTGTAT^ 

CCAATCAAATATATGTTTTGGATTTGCCAATGGTTTTACCCCTTACAAAGAATTAAATTTAAAAATAAAACATTCA 
ATTGATGGAATATCTCCTTTTATTATTGACGAAACTCAAATCAATAGCCATTCGTATGTATTAAGCAAAAAAACAA 
TTGAAAAAGAAAACTTACTAATAAATGAATGGTTTTTCTCTAAAGCT 

f584'.aa ■ - 

MI KTI LLLVL YPVWF SQ I S ANQ YFEG I YAKYQNI EDMQATINFTLKGLKQTGVLL YKF PDKF I INLDSNNQVFVS 
DGEFLTVYVPSLGTSFNQQLLKGSSGGGLMKVLNSEYSVSYTNSPNLEDLDSSEPGKYIKLTFSRKLYKGAATINS 
- . FIIAFAPDGIIRRITAFPTSGGREIVIDLTAVKFNVGILDSKFKYDPPKSSNKVDNFLYDIKKN 



t584.aa 

Q I S ANQYFEG I YAK YQNI EDMQAT INFTL KGLKQTGVLL YKF PDKF I INLDSNNQVFVSDGEFLTVYVPS LGTSFN 
QQLLKGSSGGGLMKVLNSEYSVSYTNSPNLEDLDSSEPGKYIKLTFSRKLYXGAATINSFIIAFAPDGIIRRITAF 
PTSGGREIVIDLTAVTCFNVGILDSKFKYDPPKSSNKVDNFLYDIKKN 

f584.nt _ 

ATGATAAAAACAATACTTTTATTAGTTTTGTATCCTGTTGTTGTGTTTTCTCAAATATCTGCAAATCAATATTTTG 
AAGGAATTTATGCTAAATATCAAAATATAGAGGACATGCAAGCAACAATTAATTTTACTTTA^GGGGTTAAAGCA 
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AACAGGTGTTTTGCTTTATAAGTTTCCAGACAAGTTTATTATCAATTTAGATTCAAATAATCAAGTTTTTGTAAGT 
GATGGTGAATTTTTGACAGTTTATGTTCCATCTCTTGGGACTTCTTTTAATCAGCAATTATTAAAGGGTAGTAGTG 
GGGGAGGTCTTATGAAAGTTTTAAATAGTGAGTATAGCGTATCTTATACCAATTCTCCAAATTTAGAAGATCTCGA 
TTCATCTGAGCCTGGAAAATATATTAAATTAACCTTTTCTAGAAAGCTTTACAAGGGGGCTGCTACTATTAATTCT 
TTTATTATTGCTTTTGCTCCGGATGGAATAATTAGAAGAATTACTGCTTTTCCTACTAGTGGTGGGCGCGAAATAG 
TTATTGATTTGACTGCTGTGAAGTTTAATGTTGGAATTCTTGATAGCAAATTTAAATATGATCCTCCAAAATCTTC 
AAATAAGGTAGATAATTTTTTATATGATATTAAAAAAAATTAA 

t584 .nt 

CAAATATCTGCAAATCAATATTTTGAAGGAATTTATGCTAAATATCAAAATATAGAGGACATGCAAGCAACAATTA 
ATTTTACTTTAAAGGGGTTAAAGCAAACAGGTGTTTTGCTTTATAAGTTTCCAGACAAGTTTATTATCAATTTAGA 
TTCAAATAATCAAGTTTTTGTAAGTGATGGTGAATTTTTGACAGTTTATGTTCCATGTCTTGGGACTTCTTTTAAT 
CAGCAATTATTAAAGGGTAGTAGTGGGGGAGGTCTTATGAAAGTTTTAAATAGTGAGTATAGCGTATCTTATACCA 
ATTCTCCAAATTTAGAAGATCTCGATTCATCTGAGCCTGGAAAATATATTAAATTAACCTTTTCTAGAAAGCTTTA 
CAAGGGGGCTGCTACTATTAATTCTTTTATTATTGCTTTTGCTCCGGATGGAATAATTAGAAGAATTACTGCTTTT 
CCTACTAGTGGTGGGCGCGAAATAGTTATTGATTTGACTGCTGTGAAGTTTAATGTTGGAATTCTTGATAGCAAAT 
TTAAATATGATCCTCCAAAATCTTCAAATAAGGTAGATAATTTTTTATATGATATTAAAAAAAATTAA 

£596 .aa 

MKERCLYLLVFVALCVT^LFSDDYLIYDFDLSLNEFLEVSTRKDNLEPMVDSNRILLFYPPKKEIRKIFAAFDFDQ 
YSKKYLFKKNEHGVFFVKVNI PHGTSS IKYRL IVDGVWTNDEYNKNVVYNEDLI PFSKIEI AKEKSS YI SLRNP IQ 
SYDNNEIEIFYIGRPGQIVTIAGSFNNFNPFLNRLIEKEDNKGIYTIKLKNLPKDRIYYYFIDSGNKVIDKNNVNR 
INLYFVEGIDNKIDFEVSYFDHK 

t596.aa~ - * 

DDYL I YDFDLSLNEFLEVSTRKDNLEPMVDSNRILLFYPPKKE IRK I FAAFDFDQ YSKKYLFKKNEHGVFFVKVNI 
PHGTSSIKYRLIVDGVWTNDEYNKNVVYT^EDLIPFSKIEIAKEKSSYISLRNPIQSYDNNEIEIFYIGRPGQIVTI 
AGSFNNFNPFLNRLIEKEDNKGIYTIKLKJNLPKDRIYYYFIDSGNKVIDKNNVNRINLYFVEGIDNKIDFEVSYFD 
HK 

£596. nt - 

ATGAAAGAAAGGTGTTTGTATTTATTGGTTTTTGTAGCTTTATGTGTTAACAATCTTTTTTCAGATGATTATTTAA 
TTTATGACTTTGATTTAAGTTTAAATGAATTTCTAGAAGTTTCAACAAGAAAAGACAATCTTGAGCCTATGGTTGA 
TTCCAATCGTATATTATTGTTTTATCCTCCTAAAAAAGAAATTAGAAAAATTTTTGCTGCCTTTGACTTTGATCAG 
TATTCTAAGAAATATTTATTCAAAAAAAATGAGCATGGAGTTTTTTTTGTTAAAGTTAATATTCCTCATGGCACAA 
GCAGTATAAAATATAGGCTTATTGTAGACGGTGTTTGGACTAATGACGAGTATAATAAAAATGTAGTTTATAATGA 
GGATTTAATCCCATTTTCTAAAATTGAGATCGCTAAAGAGAA.GTCCAGCTATATTTCTTTGAGAAATCCAATACAA 
TCATATGATAACAATGAAATTGAAATTTTTTACATAGGTCGTCCTGGACAAATAGTTACAATAGCTGGTAGTTTTA 
ACAATTTTAATCCTTTTTT AAATAGGCTTATTGAGAAAGAGGACAATAAGGGAATTTATACTATTAAGCTTAAAAA 
TTTACCCAAGGATAGAATTTATTATTATTTTATTGATTCTGGTAACAAAGTAATAGATAAAAATAATGTTAATAGA 
ATTAATTTATATTTTGTTGAGGGAATTGATAATAAAATAGATTTCGAAGTTTCCTATTTTGATCATAAGTAA 

t596 .nt 

GATGATTATTTAATTTATGACTTTGATTTAAGTTTAAATGAATTTCTAGAAGTTTCAACAAGAAAAGACAATCTTG 
AGC C TATGGTTGATTC C AATCGTAT ATTATTGTTTTATC C TC C TAAAAAAG AAATTAGAAAAATTTTTGCTGC C TT 
TGACTTTGATCAGTATTCTAAGAAATATTTATTCAAAAAAAATGAGCATGGAGTTTTTTTTGTTAAAGTTAATATT 
CCTCATGGCACAAGCAGTATAAAATATAGGCTTATTGTAGACGGTGTTTGGACTAATGACGAGTATAATAAAAATG 
TAGTTTATAATGAGGATTTAATCCCATTTTCTAAAATTGAGATCGCTAAAGAGAAGTCCAGCTATATTTCTTTGAG 
AAATCCAATACAATCATATGATAACAATGAAATTGAAATTTTTTACATAGGTCGTCCTGGACAAATAGTTACAATA 
GCTGGTAGTTTTAACAATTTTAATCCTTTTTTAAATAGGCTTATTGAGAAAGAGGACAATAAGGGAATTTATACTA 
TTAAGCTTAAAAATTTACCCAAGGATAGAATTTATTATTATTTTATTGATTCTGGTAACAAAGTAA.TA.GATAAAAA 
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TAATGTTAATAGAATTAATTTATATTTTGTTGAGGGAATTGATAATAAAATAGATTTCGAAGTTTCCTATTTTGAT 
CATAAGTAA 



f598.aa 

MRQRVMIAMALSCHPSLLIADEPTTALDVTIQEQILLLIK^^ 
EEGTVEEIFNNPKKPYTIGLLKSILTLEHDPNKKLYSTKENPMKITKTSTEEF 

t598 .aa • 

EPTTALDVTIQEQILLLIKNLSKKFNTSTIFITHD^ 
KSILTLEHDPNKKLYSTKENPMKITKTSTEEF 

f598.nt ' - ' 

ATGAGACAAAGAGTTATGATTGCCATGGCTCTTAGCTGTCATCCATCCTTATTAATAGCAGATGAACCAACAACAG 
CCCTTGATGTTACAATCCAAGAGCAAATATTATTATTAATCAAAAACCTATCTAAAAAATTCAATACTTCTACCAT 
ATTTATAACTCATGATCTTGCGGTTGTTGCTGAAATTTGTGATACAGTATCTGTAATGTATCAAGGAAAAATTGTA 
GAAGAAGGAACAGTAGAGGAAATATTTAACAATCCTAAGCATCCTTACACCATTGGGCTTTTAAAATCAATTCTTA 
CGCTAGAACACGATCCAAATAAAAAGCTTTATTCAACAAAAGAAAACCCTATGAAGATCACAAAAACCAGCACCGA 

GGAGTTTTAA 

t598 .nt . 

GAACC^CAACAGCCCTTGATGTTACAATCCAAGAGCAAATAOT 

ATACTTCTACCATATTTATAACTCATGATCTTGCGGTTGTTGCTGAAATTTGTGATACAGTATGTGTAATGTATCA 
AGGAAAAATTGTAGAAGAAGGAACAGTAGAGGAAATATTTAACAATGCTAAGCATCCTTACACCATTGGGCTTTTA 
AAATCAATTCTTACGCTAGAACACGATCCAAATAAAAAGCTT 
AAAC C AGC AC CG AGG AGTTTT AA " * 

f 600 .aa 

MAIMERS I IGLF I ALAFVS WLTVAR WRGQVQSL S S S EF IQAAKTLGATNQRI I LKHL I PNS IGMI VI FTTIRVPS 
F IMAEAFLS FLGLGIS APMTSWGELVQNGIATFVEYPWKVFI PAIVMTI FLLFMNFLGDGLRDAFDPKDS I 



t600 .aa - • 

RVVRGQVQSLSSSEFIQAAKTLGATNQRIILKHLIPNSIGMIVIFTTIRVPSFIMAEAFLSFLGLGISAPMTSWGE 
L VQNG I AT F VE Y P WK VF I P A I VMT I F L L FMNF LGDG LRD AF D P KD S I 

f600.nt ' . . 

ATGGCAATAATGGAAAGAAGTATAATCGGCTTATTCATAGCACTTGCATTTGTATCATGGTTAACAGTAGCTCGAG 
TTGTACGAGGCCAAGTACAATCACTATCAAGTTCGGAATTTATACAAGCAGCCAAAAGCCTTGGTGCAACAAATCA 
AAGAATAATCTTAAAACACTTGATCCCTAATAGCATTGGAATGATAGTTATATTCACAACAATAAGGGTTCCAAGC 
TTTATTAT(^CTGAAGCATTTTTATCCTTTTTAGGACTTGGAATTTCAGCTCCAATGACAAGCTGGGGAGAATTAG 
TGCAAAATGGAATTGCTACATTTGTTGAATATCCATGGAAAGTTTTTATTCCAGCTATAGTTATGACAATATTTCT 
ATTATTTATGAACTTTTTAGGTGATGGGCTAAGGGATGCTTTTGATCCAAAAGATAGCATCTAA 

t600.nt * • 

CGAGTTGTACGAGGC C AAGTAC AATC AC T ATC AAGTTCGGAATTT AT AC AAGC AGC C AAAAC CC TTGGTGC AAC AA 
ATCAAAGAATAATCTTAAAACACTTGATCCCTAATAGCATTGGAATGATAGTTATATTCACAACAATAAGGGTTCC 
AAGC TTT ATT ATGGCTGAAGCATTTTTATCCTTTTTAGGACTTGGAATTTCAGCTCCAATGACAAGCTGGGGAGAA 




TABLE 1. Nucleotide and Amino Acid Sequences 

TTAGTGCAAAATGGAATTGCTACATTTGTTGAATATCCATGGAAAGTTTTTATTCCAGCTATAGTTATGACAATAT 
TTCTATTATTTATGAACTTTTTAGGTGATGGGCTAAGGGATGCTTTTGATCCAAAAGATAGCATCTAA 

f603.aa - 

MLKFTLKKILGIIPTLLVIIFLCFFVMRMAPGSPFDSEKPIDPQVKARLMEKYHLDKPFYIQAFYYITNALRGDLG 
PSLKKKDLTVSQYIKLGFPKSLTLGVISLIISLSIGIPIGILAAIYKNTYVDYIITSIAILGISIPLFVIGPILQY 
FFAIKWGLLYTSGWITERGGFSNLILPIITLSMPNVAIFARIIRGSMLEIIQSDFIRTARAKGLSFKKIVIKHMLR 
GAMLPWSYIGPAFAAIISGSWIEKIFRIAGMGMFITESALNRDYPVLMGGLLVYSIILLISILISDIIYKILDP 
RV ■ . 

t603.aa 

. ) 
S PFDSEKP IDPQVKARLMEKYHLDKPF YIQAFYYITNALRGDLGPS LKKKDLTVSQYIKLGFPKSLTLGVI S LI I S 
LSIGIPIGILAAIYKNTYVDYIITSIAILGISIPLFVIGPILQYFFAIKWGLLYTSGWITERGGFSNLILPIITLS 
MPNVAIFARIIRGSMLEIIQSDFIRTARAKGLSFKKIV-IKHMLRGAMLPWSYIGPAFAAIISGSWIEKIFRIAG 
MGMFITESALNRDYPVLMGGLLVYSIILIjISILISDIIYKILDPRV 

f603.nt / 

ATGTTAAAGTTTACTTTAAAGAAAATATTAGGAATAATACCAACTTTACTGGTAATAATTTTTTTATGCTTTTTTG 
TAATGAGAATGGCTCCTGGAAGTCCATTTGATTCTGAAAAACCTATTGATCCTCAAGTAAAAGCAAGATTGATGGA 
AAAATATCACCTTGACAAGCCTTTTTATATTCAAGCTTTTTATTACATTACAAACGCTCTCAGGGGAGATCTGGGA 
CCTTCTTTGAAAAAGAAAGACCTTACAGTTAGTCAATACATAAAATTAGGATTTCCAAAATCACTTACACTAGGAG 
TAATATCCCTTATTATATCACTATCAATAGGAATACCAATAGGTATATTAGCTGCCATTTATAAAAATACTTATGT 
GGATTATATAATAACATCAATAGCAATATTGGGGATTTCAATACCATTATTCGTAATAGGGCCAATTTTACAATAT 
TTTTTTGGAATTAAATGGGGTTTGCTTTATACCTCTGGATGGATTACAGAAAGAGGAGGATTTTCAAATTTAATTC 
TACCCATAATAACTCTTAGCATGCCCAACGTAGCTATTTTCGCAAGAATAATCAGAGGATCAATGCTAGAAATAAT 
ACAAAGCGACTTTATAAGAACTGCGCGTGCAAAAGGGCTAAGCTTCAAAAAGATAGTTATAAAGCATATGTTAAGA 
GGAGCAATGTTGCCTGTAGTAAGCTATATAGGTCCAGCATTTGCTGCTATAATATCTGGAAGCGTGGTTATTGAAA 
AAATATTTAGAATTGCTGGAATGGGAATGTTTATAACAGAATCCGCACTAAACAGAGATTACCCAGTATTAATGGG 
CGGATTGTTAGTATATTCAATAATACTGCTTATTTCTATATTAATATCAGATATTATATATAAAATATTAGATCCA 
AGAGTATAA , .. 

t603.nt 

AGTCCATTTGATTCTGAAAAACCTATTGATCCTCAAGTAAAAGCAAGATTGATGGAAAAATATCACCTTGACAAGC 
CTTTTTATATTCAAGCTTTTTATTACATTACAAACGCTCTCAGGGGAGATCTGGGACCTTCTTTGAAAAAGAAAGA 
CCTTACAGTTAGTCAATACATAAAATTAGGATTTCCAAAATCACTTACACTAGGAGTAATATCCCTTATTATATCA 
CTATCAATAGGAATACCAATAGGTATATTAGCTGCCATTTATAAAAATACTTATGTGGATTATATAATAACATCAA 
TAGCAATATTGGGGATTTCAATACCATTATTCGTAATAGGGCCAATTTTACAATATTTTTTTGCAATTAAATGGGG 
TTTGCTTTATACCTCTGGATGGATTACAGAAAGAGGAGGATTTTCAAATTTAATTCTACCCATAATAACTCTTAGC 
ATGCCCAACGTAGCTATTTTCGCAAGAATAATCAGAGGATCAATGCTAGAAATAATACAAAGCGACTTTATAAGAA 
C TGCGC GTGC AAAAGGGC T AAGC TTC AAAAAG AT AGTT AT AAAGC AT ATGTT AAG AGGAGC AATGTTGC C TGT AGT 
AAGCTATATAGGTCCAGCATTTGCTGCTATAATATCTGGAAGCGTGGTTATTGAAAAAATATTTAGAATTGCTGGA 
ATGGGAATGTTTATAACAGAATCCGCACTAAACAGAGATTACCCAGTATTAATGGGCGGATTGTTAGTATATTCAA 
TAATACTGCTTATTTCTATATTAATATCAGATATTATATATAAAATATTAGATCCAAGAGTATAA ' 

f607.aa 

MKYIKIALMLIIFSLIACISNAKKEXIVFRVSNLSEPSSLDPQLSTDLYGSNIITNLFLGLAVXDSQTGKYKPGLA 
KSWNISEDGIIYTFNLREDIVWSDGVAITAEEIKKSYLRILNKKTAAMYiySfljIKSTIKNAQEYFDETVPESELGIK 
AIDSKTLEITLTSPKPYFPDMLTHSAYIPVPMHIVEKYGENWTNPENIWSGAYKLKERSINPKIVIEKNEKYYNA 
KNVEIDEVIFYPTEGSVAYNMYINGELDFLQGAEKNNLEEIKIR^ 

SLAIDRETLTKVVLXGSSDPTRNLTPKFDDYSYGKNLILFDPENAKKLLAEAGYPDGKGFPTLKYKISEGRPTTAE 
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FLQEQFKKILNINLEIENEEWTTFLGSRRTGNYQMSSVGWIGDYFDPLTFLDSLFTTENHFLGAYKYSNKEYDALI 
KKSNFELDPIKKQDILRQAEEIIAEKDFPMAPLYIPKSHYLFRNDKWTGWVPNIAESYLYEDIKTKK 

t607.aa 

C I SNAKKEKI VFRVSNLSEPS SLDPQLSTDLYGSNI ITNLFLGLAVKDSQTGKYKPGLAKSWNISEDGI I YTFNLR 
EDIVWSDGVAITAEEIKKSYLRILNKKTAAMYANLIKSTIKNAQEYFDETVPESELGIKAIDSKTLEITLTSPKPY 
FPDMLTHSAYIPVPMHIVEKYGENWTNPENIVVSGAYKLKERSINDKIVIEKNEKYYNAKNVEIDEVIFYPTEGSV- 
AYNMYINGELDFLQGAEKNNLEEIKIRDDYYSGLK^ 

SDPTRNLTPKFDDYSYGKNLILFDPENAKKLLAEAGYPDGKGFPTLKYKISEGRPTTAEFLQEQFKKILNINLEIE 
.NEEWTTFLGSRRTGNYQMSSVGWIGDYFDPLTFLDSLFTTENHFLGAYKYSNKEYDALIKKSNFELDPIKRQDILR 
QAEEI IAEKDFPMAPLYI PKSHYLFRNDKWTGWVPNIAESYLYEDIKTKK 

' f 607 .n't , 

ATGAAATATATAAAAATAGCCTTAATGCTAATAATTTTTTCTTTAATAGCATGTATTAGTAATGCTAAAAAAGAAA 

AAATAGTTTTCAGAGTATCAAACTTAAGCGAGCCATCATCACTTGATCCTCAACTCTCAACAGACCTTTACGGTAG 

CAACATTATTACAAACCTATTCTTAGGCCTAGCGGTAAAAGATTCTCAAACTGGAAAATATAAACCAGGACTTGCA 

AAAAGTTGGAATATTTCTGAAGATGGAATTATTTACACATTTAACCTAAGAGAAGATATAGTTTGGAGCGATGGAG 

TTGCCATTACTGCCGAGGAGATAAAAAAATCATACCTAAGAATTTTAAATAAAAAAACAGCTGCAATGTATGCTAA 

TTTAATAAAATCTACAATAAAAAATGCACAAGAATATTTCGATGAGACAGTGCCTGAATCTGAGCTTGGCATAAAG- 

GCTATTGACAGCAAAACCTTAGAGATAACATTAACATCTCCAAAGCCTTATTTTCCTGATATGCTAACACACTCAG 

CATACATACCAGTTCCAATGCATATTGTTGAAAAATATGGAGAAAATTGGACAAATCCTGAAAATATAGTTGTTAG 

TGGCGCATACAAACTTAAAGAAAGATCAATTAACGATAAAATCGTAATAGAAAAAAATGA 

AAAAATGTAGAAATTGATGAAGTAATATTTTACCCAACAGAAGGTAGCGTGGCTTACAATATGTACATAAACGGTG 

AACTCGATTTTCTACAAGGAGCAGAAAAGAATAATTTAGAAGAAATTAAAATAAGAGATGATTATTATTCTGGGTT 

AAAAAACGGAATGGCATACATAGCATTCAATACAACAATAAAACCACTAGACAATTTAAAAGTTAGACAAGCCATC 

TCCCTTGCCATTGACAGAGAAACTTTAACTAAAGTAGTTTTAAAGGGAAGTTCAGATCCAACAAGAAATCTAACTC 

CAAAATTTGATGATTAT-TCTTATGGAAAAAATTTAATACTATTTGATCCTGAGAATGCAAAAAAACTTTTAGCTGA 

AGCTGGATATCCGGATGGGAAAGGATTCCCCACATTAA^TATAAAATATCGGAGGGAAGACCAACAACAGCAGAA 

TTTTTGCAAGAACAATTTAAAAAAATACTAAACATTAACTTAGAAATCGAGAATGAAGAATGGACAACATTCC 

GAAGCAGAAGAACTGGAAATTACCAAATGTCAAGCGTGGGGTGGATAGGAGATTATTTTGATCCCTTAACATTCTT 

AGACAGCTTATTTACAACAGAAAATCATTTTTTAGGAGCGTACAAATATTCAAACAAAGAGTATGATGCTTTAATA 

AAAAAATCTAATTTTGAACTTGATCCAATAAAAAGACAAGACATTTTAAGACAAGCTGAAGAGATAATAGGAGAAA 

AAGACTTTCCTATGGCACCTTTATATATACCCAAATCTCATTATCTTTTCAGAAATGATAAATGGACAGGGTGGGT 

ACCAAATATCGCAGAAAGCTATTTATATGAAGATATTAAAACTAAAAAATAA 

t607 .nt 

TGTATTAGTAATGCTAAAAAAGAAAAAATAGTTTTCAGAGTATCAAACTTAAGCGAGCCATCATCACTTGATCCTC 
AACTCTCAACAGACCTTTACGGTAGCAACATTATTACAAACCTATTCTTAGGCCTAGCGGTAAAAGATTCTCAAAC. 
TGGAAAATATAAACCAGGACTTGCAAAAAGTTGGAATATTTCTGAAGATGGAATTATTTACACATTTAACCTAAGA 
GAAGATATAGTTTGGAGCGATGGAGTTGCCATTACTGCCGAGGAGATAAAAAAATCATACCTAAGAATTTTAAATA 
AAAAAACAGCTGCAATGTATGCTAATTTAATAAAATCTACAATAAAAAATGCACAAGAATATTTCGATGAGACAGT 
GCGTGAATCTGAGCTTGGCATAAAGGCTATTGACAGCAAAACCTTAGAGATAACATTAACATCTCCAAAGCCTTAT 
TTTCCTGATATGCTAACACACTCAGCATACATACCAGTTCCAATGCATATTGTTGAAAAATATGGAGAAAATTGGA 
CAAATCCTGAAAATATAGTTGTTAGTGGCGCATACAAACTTAAAGAAAGATCAATTAACGATAAAATCGTAATAGA 
AAAAAATGAAAAATACTATAATGCAAAAAATGTAGAAATTGATGAAGTAATATTTTACCCAACAGAAGGTAGCGTG 
GCTTACAJ^TATGTACATAAACGGTGAACTCGATTTTCTACAAGGAGCAGAAAAGAATAATTTAGAAGAAATTAAAA 
T AAGAGATG ATT ATTATTC TGGGTT AAAAAACGG AATGGC AT AC AT AGC ATTC AAT AC AAC AATAAAAC C AC T AGA 
CAATTTAAAAGTTAGACAAGCCATCTCCCTTGCCATTGACAGAGAAACTTTAACTAAAGTAGTTTTAAAGGGAAGT 
TCAGATCCAACAAGAAATCTAACTCCAAAATTTGATGATTATTCTTATGGAAAAAATTTAATACTATTTGATCCTG 
AG AATGC AAAAAAAC TTTT AGCTG AAGC TGGAT ATC CGGATGGG AAAGGATTC C C C AC ATT AAAATAT AAAATATC 
GGAGGGAAGACCAACAACAGCAGAATTTTTGCAAGAACAA^ 

AATGAAGAATGGACAACATTCCTAGGAAGCAGAAGAACTGGAAATTACCAAATGTCAAGCGTGGGGTGGATAGGAG 
ATTATTTTGATCCCTTAACATTCTTAGACAGCTTATTTACAACAGAAAATCATTTTTTAGGAGCGTACAAATATTC 
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AAAC AJ^AG A.GT ATG ATGC TTT AAT AAAAAAATC T AATTTTGAAC TTG ATC C AAT AAAAAGAC AAG AC ATTTT AAGA 
CAAGC TGAA.GA.GAZ AAT AGC AGAAAAAGAC TTTC C T ATGGC ACC TTTAT AT AT AC C C AAATCTC ATTATCTTTTC A 
GA—TGAZAJ^AT^^ 



^C<ZTZTZFZ3FYZFGFEDSSLKIGIDDV^ 
N!O^TGDSI?.ZI^:G?r/i:<2SfXELLSLTSSTPVPNKKFGELAF 
IP.S j;EKKY^OYLG?//QDNAYELLFKDDQNQGK^ 

OL- 7~L™/DV"T3 SM^S^TZ E ILKEHLFSIIE PQLQKFKS YRIGLVF YKDYLEDFLTKAFDFNT I P YLNNI LKYVNVGGG 
GDY?ZAvTZGXD.-_V/TQFDWRAEI^FIIVIGDAPPHEYPRGSIVYKDVINSAKEKDITIYGIIF^ 



FE3SSLKZGZDD\rr-/ZAJiEEGFHLFIRKKPAlksVILTESFEIPDKKKDVATYSFRTLSY 

KL'n^ZZLSZTSSTPYPNKXFG 

DNA//ZLLTKDDQKQGXIEFNELKDTF 

EIZ:<rHL-SZIE?QLQKFKSYRIGLVFYKDYLEDFLTKAFDFNTIPYLNNI^ 
Q 73 vr?AZ?_?_r Z IV Z GD A? P H E Y P RG S I VYKDV I NS AKE KD I T I YG 1 1 F Q 



ATGAAJSAAJ^JVTTTTTTTATTTCTTTTTATTAGTTTTTATTTGTTTC^ 
TTGIVTGATGTTTATGTTGATCCTC 

AAT AZTGAC AGAGTC TTTTGAAATTC C TGAT AAG AAAAAAGATGTGGC TACTTATTC ATTTCGTAC ATT AAGTTAT 
AAT AAGGT T AA.TGG AG 

ATGAAATTCGGATTTTAAATGGAAGAGTTATTAAGAATAAAGAACTTTTATCATTGACAT 
CTTCCA.CCCCTGTTCCTAATAAAAAGTTTGGAGAAGCTTTTCATATATTGATTCCAAAAAAATTAAAATATGGATT 

TCCAAJVITTTTCAACAAGAAGTGGTGATATTGACTTAGAA^^ 
AT^GATC TCTTC-AGAJ^AAAAT AT AATG ATT A 

ATGACCAAiLVTCAGGGAAAAATTGAATTTAATGAATTAAAAG ' 
TGCTAAJIAJVZGGCATTGATATTGTTGATAAAAT/yVACAAA^ 

GATTTAGTGC TTGTTGTTGATGTTACTGATAGTATGAAAAGCAATATTGAGATTCTAAAAGAGCATTTGTTTTCAA 
T AAT A.GAAC C TC A.-.C TTC AAAAGTTT AAATC C T AC AGAAT AGGTCTTGTTTTTT AT AAAGACT ATC TTGAAGATTT 
TTT AA.C C AAA.GC TTTTGATTTT AAT AC T ATTC CTTATTTAAAT AAT ATTC TTAAGT ATGTT AATGTTGGTGGC GGT 
C^GGAJTTATCCAGAA.GCTGTTTTTGAGGGGATTGATGCTGCTGTGACCCAATTTGATTGGCGGGCAGAAAGAAGGT 

TTATCATTGTTA 

TAGGAGATGCACCTCCTCATGAGTATCCAAGAGGGTCTATTGTTTATAAAGATGTTATCAATTC 
.TGC AAA.GGAAA AAG AT ATT AC AATTT ATGGAAT AAT ATTTC AGT AA 



q^TGAJlGATAGTTCTTTGAAAATAGGTATTGATGATGTTTATGTTGAGGCTCATGAAGAGGGATTTCATCTTTTTA 
TTAGAJULAAAA.CCTGCAATCAAATCAGTAATATTGACAGAGTCTTTTGAA 

TACTCATTCATTTCGTACATTAAGTTATAATAAGGTTAATGGAGATGAAATTCGGATTTTAAATGGAAGAGTTATT 
A A.GA.AT AAAGA A.C TTTT ATC ATTGAC ATGTTC C ACCC CTGTTC CTAATAAAAAGTTTGGAGAAGC TTTTC AT AT AT 
TGATTCCAAJ^AJLAATTAAAATATGGATTTCCAAATTTTTCAACAAGAAGTGG 

£ A.GTA-A AJVAlA.GAGC CC TTTTGGTTTTCT AT AAGATC TTTTG AG AAAAAAT AT AATGATTATTTGGGC AGAT ATC AA 

gacaa-tc<:ttatgaattgcttttcaaggatgatcaaaatcagggaaaaattgaatttaa 
ttacaj^j^/tt^tcagatgaggttgttattgctaataatgg 

aaj^ctcagaj^gaztcagtttatgatttagatttagtgcttgttgttgatgttactgatagtatgaaaagcaatatt 

gaga.ttc t aaaa.g agc atttgttttc aat aatagaac c tc aacttc aaaagtttaaatc c t ac agaat aggtc ttg 
ttttttataj^agactatcttgaagattttttaaccaaagcttttgattttaatactattccttatttaaataatat 
tctt aa-gt atgtt aatgttggtggc ggtggggatt atc c ag aagctgtttttg aggggattgatgctgctgtgac c 
caatttgattc<k:gggcagaaagaaggtttattattgttataggagatgcacctcctcatgagtatccaagagggt 
ctat'tgtttataaagajtgttatcaattctgcaaaggaaaaagatattacaa 
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f617.aa 

MIFFRNSFMALIFSFSILSISYFFGDFFQFSYIKMISWRFILFLIMATGIATCAKSNSLNLGNEGQIYFGAFLVYI 
FSSFFGLTYFNFVFLILLSSFFVGLLGLIPFFITFFFGLNKALTC 

NSLFALDSSLIYLFLLGVSVWLFYVFIHKKTIYGLQLEILSNKKKIDIFFNINEFKYKFFAVFGSAFLNGLAGSMF 
WFFRPYLVLGLTSGLGWSSLIVAVISGFNYVYVLFFSLLFSILIEFNNFLNINYDFKYEFIGLCQSIAIFISLFL 
IKARKK 

t617.aa . ' 

AKSNSLNLGNEGQIYFGAFLVYIFSSFFGLTYFNFVFLILLSSFFVGLLGLIPFFITFFFGLNKALTGLLISYGNQ 
RLVDGF I LNMLKTGS FSNQTKRINS LF ALDS S L I YLFLLGVS VWLF YVF I HKKT I YGLQLE I LSNKKKIDI F FNIN 
EFKYKFFAVFGSAFLNGLAGSMFWFFRPYLVLGLTSGLGWS^^ 
NYDFKYEFIGLCQS.IAIFISLFLIKARKK- ■ ~ - 

f 617 .nt 

ATGATCTTTTTTAGAAATAGCTTTATGGCATTAATTTTTTCTTTTTCAATATTAAGTATTAGCTATTTTTTCGGTG 
ATTTTTTTCAATTTTCTTATATTAAAATGATATCTTGGCGCTTTATTTTATTTTTAATTATGGCTACGGGGATTGC 
TACTTGTGCCAAGAGTAATTCATTAAATCTTGGGAATGAAGGTCAGATTTATTTTGGGGCATTTTTAGTTTATATA 
TTTTCAAGTTTTTTTGGATTAACCTATTTTAATTTTGTATTTTTGATACTTTTAAGTTCTTTTTTTGTAGGACTTT 
TGGGGCTTATCCCCTTTTTTATTACTTTTTTCTTCGGATTAAATAAAGCCTTAACAGGTCTTTTAATATCTTATGG 
AAATCAAAGATTGGTGGATGGATTTATTTTAAATATGTTAAAAACAGGTAGTTTTTCTAATCAGACAAAAAGGATT 
AATAGTTTGTTTGCTTTAGATTCATCACTTATTTACTTGTTTTTGCTTGGTGTATCAGTTTGGCTTTTTTATGTTT 
TTATTCACAAAAAAACTATTTATGGTCTTCAGCTTGAAATATTAAGCAATAAAAAAAAGATAGACATTTTTTTCAA 
TATAAATGAATTTAAATATAAGTTTTTCGCTGTATTTGGCAGTGCTTTTTTAAATGGTCTTGCAGGTTCTATGTTT 
GTAGTGTTTTTTAGACCATATTTGGTTTTAGGGCTAACTTCAGGACTTGGTTGGAGTAGTCTAATTGTTGCTGTAA 
TTTCAGGATTTAATTATGTTTATGTATTATTTTTTAGCTTATTGTTTTCAATATTAATTGAATTTAATAATTTTCT 
TAATATAAATTATGACTTTAAGTATGAATTTATTGGGCTTTGTCAATCAATTGCTATTTTTATCTCTTTATTTTTG 
ATTAAAGCTAGGAAAAAGTAG 

t617 .nt ^ 

GCCAAGAGTAATTCATTAAATCTTGGGAATGAAGGTCAGATTTATTTTGGGGCATTTTTAGTTTATATATTTTCAA 
GTTTTTTTGGATTAACCTATTTTAATTTTGTATTTTTGATACTTTTAAGTTCTTTTTTTGTAGGACTTTTGGGGCT 
TATCCCCTTTTTTATTACTTTTTTCTTCGGATTAAATAAAGCCTTAACAGGTCTTTTAATATCTTATGGAAATCAA 
AGATTGGTGGATGGATTTATTTTAAATATGTTAAAAACAGGTAGTTTTTCTAATCAGACAAAAAGGATTAATAGTT 
TGTTTGCTTTAGATTCATCACTTATTTACTTGTTTTTGCTTGGTGTATCAGTTTGGCTTTTTTATGTTTTTATTCA 
CAAAAAAACTATTTATGGTCTTCAGCTTGAAATATTAAGCAATAAAAAAAAGATAGACATTTTTTTCAATATAAAT 
GAATTTAAATATAAGTTTTTCGCTGTATTTGGCAGTGCTTTTTTAAATGGTCTTGCAGGTTCTATGTTTGTAGTGT 
TTTTTAGACCATATTTGGTTTTAGGGCTAACTTCAGGACTTGGTTGGAGTAGTCTAATTGTTGCTGTAATTTCAGG 
ATTTAATTATGTTTATGTATTATTTTTTAGCTTATTGTTTTCAATATTAATTGAATTTAATAATTTTCTTAATATA 
AATTATGACTTTAAGTATGAATTTATTGGGCTTTGTCAATCAATTGCTATTTTTATCTCTTTATTTTTGATTAAAG 

C T AGG AAAAAGT AG - 
f 631 .aa 

MWEINSLRTCYLLVLLLLVAYGLVVFYTSSFFLSLELTGNPNFLFFTRLNYLFLSFMVFLVFERISLNFLKK3IF 

PVLIITLFLIMATFLSPSISGAKRWIFFQGVSIQPSEIFKISFTIYLSAYLSKFDPRKNNGISYWIKPMLIFAIFW 

VLIILQNDYSTAIYFAILFFIVLFVSNMAFSYVFAIVVTFLPVSAIFLMLEPYRVSRIFAFLNPYDDPSGK^ 

ASLNALKSGGILGKGLGMGEVKLGKLPEANSDFIFSVLGEELGFLGVLFAISLFFLFFYFGYFIAIHSNSRFKFFI 

AFISSI^IFLQSMMNILIAIGLLPPTGINLPFFSSGGSSIIVTMALSGLISNVSKNLSNN 



t631.aa 
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RISLNFLKKSIFPVLIITLFLIMATFLSPSISGAKRWIFFQGVSIQPSEIFKISFTIYLSAYLSKFDPRKNNGISY 
WIKPMLIFAIFWVLIILQNDYSTAIYFAILFFIVLFVSNMAFSYVFAIVVTFLPVSAIFLMLEPYRVSRIFA.FLNP 
YDDPSGKGYQIIASLNALKSGGILGKGLGMGEVKLGKLPEANSDFIFSVLGEELGFLGVLFAISLFFLFFYFGYFI 
AIHSNSRFKFFIAFISSLAIFLQSMMNILIAIGLLPPTGINLPFFSSGGSSIIVTMALSGLISNVSKNLSNN 

*f631.nt 

ATGGTTGTAGAGATAAATTCACTTAGGACATGTTATTTGCTTGTTTTGCTGCTATTGGTAGC.CTATGGCCTTGTAG 
TTTTTTATACTTCTTCCTTTTTTCTAAGCTTAGAATTGACAGGTAATCCAAATTTTTTATTTTTCACAAGACTTAA 
TTATCTTTTTTTAAGTTTTATGGTTTTTCTTGTTTTTGAAAGGATTTCTTTAAATTTTTTAAAAAAATCAATATTT 
CCTGTATTGATTATAACTCTTTTTTTAATTATGGCAACTTTTTTATCTCCAAGTATTTCTGGAGCAAAGAGATGGA 
TATTCTTTCAAGGTGTTAGCATTCAACCTTCTGAGATTTTTAAAATATCTTTTACTATTTATCTTTCAGCTTATTT 
GAGCAAGTTTGACCCAAGAAAAAACAATGGTATTTCATACTGGATAAAGCCAATGTTGATTTTTGCAATTTTTTGG 
GTGTTAATAATTTTGCAAAACGATTATTCAACAGCTATTTATTTTGCCATTCTTTTTTTTATTGTTTTGTTTGTTT 
CTAATATGGCATTTAGCTATGTTTTTGCTATTGTGGTTACTTTTTTACCAGTTTCTGCTATATTCTTGATGCTTGA 
ACCTTATAGGGTTTCTAGAATTTTTGCCTTTCTCAATCCTTACGATGATCCTTCTGGCAAAGGTTACCAGATAATA 
• GCATCTCTTAATGCTTTAAAAAGTGGAGGAATTTTAGGTAAAGGGCTGGGAATGGGAGAGGTAAAACTTGGAAAAT 
TACCAGAGGCCAATTCGGATTTTATTTTTTCAGTTCTTGGAGAAGAATTAGGATTTTTAGGGGTTTTGTTTGCTAT 
AAGCTTGTTTTTTTTGTTTTTTTACTTTGGTTATTTTATAGCTATTCATTCTAATAGTAGGTTTAAATTTTTTATT 
GCATTTATTTCAAGTCTTGCAATTTTTCTTCAAAGCATGATGAATATTTTAATTGCAATCGGTCTTTTGCCTCCTA 
CAGGGATAAATTTACCATTTTTTTCATCTGGGGGATCTTCTATTATTGTTACCATGGCATTGTCTGGCCTTATTTC 
AAATGTTTCAAAAAATTTAAGTAATAATTGA 

t631.nt 

AGGATTTCTTTAAATTTTTTAAAAAAATCAATATTTCCTGTATTGATTATAACTCTTTTTTTAATTATGGCAACTT 

TTTTATCTCCAAGTATTTCTGGAGCAAAGAGATGGATATTCTTTCAAGGTGTTAGCATTCAACCTTCTGAGATTTT 

TAAAATATCTTTTACTATTTATCTTTCAGCTTATTTGAGCAAGTTTGACCCAAGAAAAAACAATGGTATTTCATAC 

TGGATAAAGCCAATGTTGATTTTTGCAATTTTTTGGGTGTTAATAATTTTGCAAAACGATTATTCAACAGCTATTT 

ATTTTGCCATTCTTTTTTTTATTGTTTTGTTTGTTTCTAATATGGCATTTAGCTATGTTTTTGCTATTGTGGTTAC 

TTTTTTACCAGTTTCTGCTATATTCTTGATGCTTGAACCTTATAGGGTTTCTAGAATTTTTGCCTTTCTCAATCCT 

TACGATGATCCTTCTGGCAAAGGTTACCAGATAATAGCATCTCTTAATGCTTTAAAAAGTGGAGGAATTTTAGGTA 

AAGGGCTGGGAATGGGAGAGGTAAAACTTGGAAAATTACCAGAGGCCAATTCGGATTTTATTTTTTCAGTTCTTGG 

AGAAGAATTAGGATTTTTAGGGGTTTTGTTTGCTATAAGCTTGTTTTTTTTGTTTTTTTACTTTGGTTATTTTATA ■ 

GCTATTCATTCTAATAGTAGGTTTAAATTTTTTATTGCATTTATTTCAAGTCTTGCAATTTTTCTTCAAAGCATGA 

TGAATATTTTAATTGCAATCGGTCTTTTGCCTCCTACAGGGATAAATTTACGATTTTTTTCATCTGGGGGATCTTC 

TATTATTGTTACCATGGCATTGTCTGGCCTTATTTCAAATGTTTCAAAAAATTTAAGTAATAATTGA 

f 647 .aa 

MKVNNFLSFFFRAFFLLFLIVILFFFVLFFIDFIGMYNTKRYFPEFVRTKLLGETSLVFDHNSNIILDEARLVKER 
EAIDIKNQQIEKLKEDLKLKEDSLNKL^ 

RLEDLNPELAISYMRKIEELSKKEGRLSIVPYWLSLMDSKKAAILIRKMSVSSLE 
t647.aa . . 

IDFIGMYNTKRYFPEFVRTKIiLGETSLVFDHNSNIILDEARLVKEREAIDIKNQQXEKLKEDLKLKEDSLNKLEFE 
LKQKQKDLDLKQKIIDDIINKYNDEEANILQTAVYLMNMPPEDAVKRLEDLNPELAISYMRKIEELSKKEGRLSIV 

PYWLSLMDSKKAAILIRXMSVSSLE 
£647. nt 

ATGAAAGTGAATAATTTTTTATCGTTCTTTTTTAGGGCATTTTTTTTGTTATTTTTAATTGTTATTTTATTTTTCT 
TTGTATTATTCTTTATTGATTTTATTGGAATGTATAATACTAAAAGATATTTCCCCGAATTTGTAAGAACCAAGTT 
GTTAGGAGAAACTTCTCTGGTCTTTGATCATAATTCTAATATAATTCTTGATGAAGCTAGACTTGTGAAGGAAAGA 

gaagctattgatattaagaatcagcagattgaaaagcttaaagaagatctaaaGttaaaagaagacagtttaaata- 
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AGCTTGAATTTGAGCTTAAGCAAAAGCAGAAA^ 

ATATAATGATGAGGAAGCAAATATTTTGCAAACAGCTGTATATTTAATGAATATGCCACCAGAAGATGCTGTTAAG 
CGGCTTGAAGATTTAAATCCCGAGCTTGCAATATCTTATATGCGGAAAATTGAAGAGCTTTCCAAAJ^AAGAAGGTC 
GTTT ATC AATTGTTCC TT ATTGGTTATC TC TTATGGATTCTAAAAAAGC TGC T AT ATTGATTAG AAAAATGTC TGT 
TAGTTCATTGGAGTAG 

t647.nt _ 

ATTGATTTTATTGGAATGTATAATACTAAAAGATATTTCCCCGAATTTGTAAGAACCAAGTTGTTAGGAGAAACTT 
CTCTGGTCTTTGATCATAATTCTAATATAATTCTTGATGAAGCTAGACTTGTGAAGGAAAGAGAAGCTATTGATAT 
TAAGAATCAGCAGATTGAAAAGCTTAAAGAAGATCTAAAGTTAAAAGAAGACAGTTTAAATAAGCTTGAATTTGAG 
CTTAAGCAAAAGCAGAAAGATTTAGATTTAAAACAAAAAATAATAGATGACATTATAAATAAATATAATGATGAGG 
AAGCAAATATTTTGCAAACAGCTGTATATTTAATGAATATGCCACCAGAAGATGCTGTTAAGCGGCTTGAAGATTT 
AAATCCCGAGCTTGCAATATCTTATATGCGGAAAATTGAAGAGCTTTCCAAAAAAGAAGGTCGTTTATCAATTGTT 
CCTTATTGGTTATCTCTTATGGATTCTAAAAAAGCTGCTATATTGATTAGAAAAATGTCTGTTAGTTCATTGGAGT 

AG 

f653.aa 

MLTYGDMVTLLLVFFVTMFSLNDIIFQENVIRIMSASFTGAGFFKGGKTLDFSKLSYLSNSFMSLPSTVRNKQASQ 
TAKNKSMIEFIEKIQSKNIVVRQEERGIVISLAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGH . 

TDNIDTDVNGPWKSNWELSAARSVNMLEHILNYL 
TTDASLSFPKEIKQ 

t65 3.aa 

NDIIFQENVIRIMSASFTGAGFFKGGKTLDFSKLSYLSNSFMSLPSTVRNKQASQTAKNKSMIEFIEKIQSKNIVV 
RQEERGIVISLAADAFFDSASADVKLEENRDSIQKIASFIGFLSPRGYNFKIEGHTDNIDTDVNGPWKSNWELSAA 
RSVNMLEHILNYLDQSDVKRIENNFEVSGFGGSRPIATDDTPEGRAYNRRIDILITTDASLSFPKEIKQ 

f653.nt 

ATGTTGACTTATGGAGACATGGTTACTTTGCTGCTTGTGTTTTTTGTTACAATGTTTTCATTAAATGATATTATTT 
TTCAAGAAAATGTGATAAGAATAATGTCTGCTTCTTTCACGGGTGGTGGATTTTTCAAGGGCGGTAAAACTTTAGA 
TTTTAGTAAATTATCTTATTTGAGl'AATAGCTTTATGTCTTTGCCTTCTACTGTGCGCAATAAA.CAAGCATCTCAG 
ACTGCTAAAAATAAATCCATGATTGAATTTATTGAGA 

GAGGTATTGTAATATCTCTTGCAGCAGATGCATTTTTTGATTCTGCTAGTGCAGATGTTAAGCTTGAAGAGAATAG 
AGATTCTATTCAAAAAATAGCATCTTTTATTGGCTTTTTAAGTCCTAGAGGCTATAATTTTAAAATAGAAGGGCAT 
ACAGATAATATTGATACTGATGTAAATGGACCTTGGAAAAGCAATTGGGAACTTTCGGCTGCTAGATCTGTTAATA 
TGCTGGAACATATTTTGAACTATTTAGATCAATCTGATGTTAAAAGAATTGAAAATAATTTTGAAGTATCTGGTTT 
TGGTGGAAGT AGGC C T ATTGC AAC AGACGAT AC C C C TGAGGGT AGGGC TT AT AAT AGAAGAATTGAT AT ATTAATT 
ACTACAGATGCATCTTTAAGTTTCCCTAAGGAAATTAAGCAGTAA 

t653.nt 

AATGATATTATTTTTCAAGAAAATGTGATAAGAATAATGTCTGCTTCTTTCACGGGTGCTGGATTTTTCAAGGGCG 

GTAAAACTTTAGATTTTAGTAAATTATCTTATTTGAGTAATAGCTTTATGTCTTTGCCTTCTACTGTGCGCAATAA 

ACAAGCATCTCAGACTGCTAAAAATAAATCCATGATTGAATTTATTGAGAAGATTCAGTCTAAAAATATTGTAGTT . 

AGGC AAGAAGAAAGAGGTATTGT AAT ATCTCTTGCAGCAGATGCATTTTTTGATTCTGCTAGTGCAGATGTTAAGC 

TTGAAGAGAATAGAGATTCTATTCAAAAAATAGCATCTTTTATTGGCTTTTTAAGTCCTAGAGGCTATAATTTTAA 

AATAGAAGGGCATACAGATAATATTGATACTGATGTAAATGGACCTTGGAAAAGCAATTGGGAACTTTCGGCTGCT 

AGATCTGTTAATATGCTGGAACATATTTTGAACTATTTAGATCAATCTGATGTTAAAAGAATTGAAAATAATTTTG 

AAGTATCTGGTTTTGGTGGAAGTAGGCCTATTGCAACAGACGATACCCCTGAGGGTAGGGCTTATAATAGAAGAAT 

TGATATATTAATTACTACAGATGCATCTTTAAGTTTCCCTAAGGAAATTAAGCAGTAA 

f664.aa 
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MRMSVYTMGFAYIRSIMGYVVLFFFASLAVNFFVNIIQV^ 

FKSLLKWIICLIYYFIIENNIGKISKIjSEYTLQSGISIVLVIAYKICFFSVMFLAIVGVFDYLFQRSQYIESLKM 

TKEEVKQERKEMEGDPLLRSRIKERMRVILSTNLRVAIPQADVVIT^ 

LTIKKIARENJWPLMENKLLARALYANVK^ 

t664.aa 

FVNIIQVGFFITFKSLEPRWDKISFNFSRWAKNSFFSAGAFFNLFKSLLKWIICLIYYFIIENNIGKISKLSEYT 
LQSGISIVLVIAYKICFFSVMFLAIVGVFDYLFQRSQYIESLKMTKEEVKQERKEMEGDPLLRSRIKERMRVILST 
NLRVAIPQADWITNPEHFAVAIKWDSETMLAPKVL^^ 
EI PREYWEIVSKI LVRVYS ITKKFN 

- f664.nt " . 

ATGCGTATGAGTGTTTATACTATGGGTTTTGCATATATTAGATCTATCATGGGGTATGTCGTTTTGTTTTTTTTCG 
CATCTTTAGCTGTTAATTTTTTTGTTAATATTATTCAAGTAGGCTTTTTTATTACTTTTAAATCTTTGGAGCCAAG 
GTGGGATAAAATTAGTTTTAATTTTTCCAGATGGGCAAAAAATTCTTTTTTTTGAGCAGGGGCTTTTTTCAATTTG 
TTTAAAAGTTTGTTAAAAGTTGTTATAATATGCTTGATATATTATTTTATTATAGAAAACAATATAGGCAAAATTT 
CTAAGCTTTCGGAGTATACACTTCAATCTGGAATTTCTATTGTGTTAGTGATTGCCTATAAGATATGTTTTTTTTC- 
AGTAATGTTTT.TGGCAATTGTAGGGGTGTTTGATTATTTGTTTCAAAGATCTCAGTACATTGAGAGTTTGAAAATG 
ACAAAAGAAGAGGTAAAGCAGGAAAGAAAGGAAATGGAAGGTGATCCTTTACTTCGATCTAGAATAAAAGAGAGAA 
TGAGGGTTATTTTAAGTACCAATTTAAGAGTAGCTATTCCTCAAGCAGATGTAGTAATTACAAATCCAGAACATTT 
TGCAGTTGCTATTAAATGGGATAGCGAAACAATGTTAGCTCCAAAGGTGCTTGCAAAAGGTCAAGATGAAATAGCT 
CTCACAATTAAAAAAATTGCAAGAGAAAATAATGTTCCTTTAATGGAAAATAAGCTCCTTGCAAGAGCTCTTTATG 
CTAATGTTAAGGTTAATGAAGAGATTCCAAGAGAATATTGGGAGATTGTTTCAAAAATTCTTGTGAGAGTATATTC 
TATTACTAAAAAGTTTAATTAG 

t664.nt 

TTTGTTAATATTATTCAAGTAGGCTTTTTTATTACTTTTAAATCTTTGGAGCCAAGGTGGGATAAAATTAGTTTTA 
ATTTTTCCAGATGGGCAAAAAATTCTTTTTTTTCAGCAGGGGCTTTTTTCAATTTGTTTAAAAGTTTGTTAAAAGT 
TGTTATAATATGCTTGATATATTATTTTATTATAGAAAACAATATAGGCAAAATTTCTAAGCTTTCGGAGTATACA 
CTTCAATCTGGAATTTCTATTGTGTTAGTGATTGCCTATAAGATATGTTTTTTTTCAGTAATGTTTTTGGCAATTG 
TAGGGGTGTTTGATTATTTGTTTCAAAGATCTCAGTACATTGAGAGTTTGAAAATGACAAAAGAAGAGGTAAAGCA 
GGAAAGAAAGGAAATGG AAGGTGATC C TTT ACTTCGATC TAGAAT AAAAG AGAGAATGAGGGTT ATTTTAAGT AC C " 
. AATTTAAGAGTAGCTATTGCTCAAGCAGATGTAGTAATTACAAATCCAGAACATTTTGCAGTTGCTATTAAATGGG 
ATAGCGAAACAATGTTAGCTCCAAAGGTGCTTGCAAAAGGTCAAGATGAAATAGCTCTCACAATTAAAAAAATTGC 
AAGAGAAAATAATGTTCCTTTAATGGAAAATAAGCTCCTTGCAAGAGCTCTTTATGCTAATGTTAAGGTTAATGAA 
GAGATTCCAAGAGAATATTGGGAGATTGTTTCAAAAATTCTTGTGAGAGTATATTCTATTACTAAAAAGTTTAATT 
AG . - 

f680.aa 

MFTLSFVLINFIITGILILMLEFNFLKVDFKGNILXjAGIFMGLMQGLGAL PGISRSGITIFSASVIGFNRKSAFEI 
SFLSLIPIVFGAILLKHKEFYDIFMVLNFFEINLGALVAFWGIFSINFFFKMLNNKKLYYFSIYLFALSIIVCYF 
VRI 

t680.aa ^ 

ITGILILMLEFNFLKVDFKGNILLAGIFMGLMQGLGAiLPGISRSGITIFSASVIGFNRKSAFEISFLSLIPIVFGA 
ILLKHKEFYDIFMVLNFFEINLGAL.VAFWGIFSINFFFKMLNNKKLYYFSIYLFALSIIVCYFVRI 



f 680 .nt 
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ATTGTTTACATTGTGTTTCGTTTTAATTAATTTTATTATAACAGGGATTTTAATCTTGATGCTAGAATTTAATTTTT 
TAAAAGTTGATTTTAAAGGTAATATTTTGTTAGCAGGAATTTTTATGGGGCTGATGCAAGGCTTGGGTGCGCTTCC 
AGGAATCTCTCGTTCAGGAATTACGATCTTTTCGGCATCGGTTATTGGATTTAATAGAAAAAGTGCATTTGAAATT 
TCATTTTTATCTTTAATTCCAATAGTTTTTGGAGCGATTTTATTAAAACATAAAGAATTTTATGATATTTTTATGG 
TTTTAAATTTTTTTGAAATAAACTTAGGAGCATTAGTTGCTTTTGTTGTTGGTATTTTCTCAATAAATTTCTTTTT 
TAAAATGCTTAATAACAAAAAACTGTATTATTTTTCTATATATTTATTTGCACTTTCAATTATAGTTTGTTATTTT 

GTTAGAATATGA 
t680.nt 

ATAACAGGGATTTTAATCTTGATGCTAGAATTTAATTTTTTAAAAGTTGATTTTAAAGGTAATATTTTGTTAGCAG 

GAATTTTTATGGGGCTGATGCAAGGCTTGGGTGCGCTTCCAGGAATCTCTCGTTCAGGAATTACGATCTTTTCGGC 

ATCGGTTATTGGATTTAATAGAAAAAGTGCATTTGAAATTTCATTTTTATCTTTAATTCCAATAGTTTTTGGAGCG 

ATTTTATTAAAACATAAAGAATTTTATGATATTTTTATGGTTTTAAATTTTTTTGAAATAAACTTAGGAGCATTAG 

TTGGTTTTGTTGTTGGTATTTTCTCAATAAATTTCTTTTTTAAAATGCTTAATAACAAAAAACTGTATTATTTTTC . 

TATATATTTATTTGCACTTTCAATTATAGTTTGTTATTTTGTTAGAATATGA 

f 6 8 8 . aa ' 

MIVLLISIGCANAVHIINEIFKLIKKEQLSKESIKATIKKLKTPILLTSFTTAFGFLSLTTSSINAYKTMGIFMSI 
GVIISMIISLTVLPGIITLIPFAKKKSFEKEKENKLNKISFLERLAKLNTQITKSILKRKYTSSIMVLIILGISFV 
GLLKIEINFDEKDYFKESTSVKKTLNLMQKEMGGISIFKIEIEGRPGEFKNAKAMQILDLITDKLDAFSAKTQSSS 
INGILKFTNFKIKKESPLEYKLPENKIILNKLINLIDKSDWTKDNKRMYINDDWSLISIIVRIEDNSTEGIKKFEK 
YAINTINEYMKNNKYHFSGVYDKVLI AKTMVKEQVINI ITTLGS ITLLLMFFFKS IKTGI I I AI PVAWSVFLNFAV 
MRLFG ITLNP ATAT I AS VSMGVGVDYS I HFFNTF I LQ YQKNQ I YKTALLE S I PNVFNG I FANS I S VGIGFLTLTF S 
S YKI I STLGAI IAFTML.TTSLAS LTLLPLLI YLFKPRVKLASNNNFKKLKQZ 

t683.aa 

YKTMGIFMSIGVIISMIISLTVLPGIITLIPFAKKKSFEKEKENKLNKISFLERLAKLNTQITKSILKRKYTSSIM 
VLIILGISFVGLLKIEINFDEKDYFKESTSVKKTLNLMQKEMGGISIFKIEIEGRPGEFKNAKAMQILDLITDKLD 
AFSAKTQSSSINGILKFTNFKIKKESPLEYKLPENKIILNKLINLIDKSDWTKDNKRI^INDDWSLISIIVRIEDN 
STEGIKKFEKYAINT INEYMKNNKYHFSGVYDKVLI AKTMVKEQVINI ITTLGS ITLLLMFFFKSIKTG 
AWSVFLNFAVMRLtFGITLNPATATIASVSMGVGVDYS IHFFNTF ILQYQKNQI YKTALLES I PNVFNG I FANS I SV 
GIGFLTLTFSSYKIISTLGAIIAFTMLTTSLASLTLLPLLIYLFKPRVKLASNNNFKKLKQ2 

f688.nt • 

ATGATTGTTTTACTTATTTCAATCGGATGCGCCAATGCTGTACATATAATAAATGAAATATTTAAATTAATAAAAA 

AAGAACAGCTCTCAAAAGAATCCATAAAAGCAACAATTAAAAAACTTAAAACACCCATCCTGCTAACATCTTTTAC 

AACTGCATTTGGATTTTTATCTCTTACAACCTCTTCAATTAATGCCTACAAAACAATGGGTATTTTCATGTCAATT 

GGAGTAATTATCTCAATGATAATCTCATTAACCGTTTTACCTGGAATAATAACATTAATCCCATTTGCAAAAAAAA 

AGTCTTTTGAAAAAGAAAAAGAAAATAAACTAAATAAAATATCCTTCCTTGAAAGACTTGCCAAACTAAATACGC 

AATAACAAAATCTATATTAAAAAGAAAATATACATCCTCTATAATGGTCCTCATCATACTGGGAATTTCTTTTGTA 

GGTCTTTTAAAAATCGAAATCAATTTTGATGAAAAAGATTACTTTAAAGAAAGCACAAGTGTAAAAAAAACATT^ 

ACCTAATGCAAAAAGAAATGGGGGGAATATCGATTTTCAAAATAGAJVATTGAAGGCAGGCCCGGTGAATTTAAAAA 

TGCTAAAGCAATGCAAATATTAGACTTAATTACAGATAAGCTTGATGCATTTTCTGCAAAAACTCAATCTAGTTCT 

ATTAATGGCATTTTAAAATTTACAAATTTTAAi^TTAAAAAAGAATCCCCACTAGAGTATAAACTGCCTGAAA 

AAATTATACTAAACAAACTAATAAATTTGATAGATAAAAGCGATTGGACTAAGGACAATAAAAGAATGTACAT 

CGATGACTGGTCATTAATATCTATCATAGTAAGAATTGAAGACAACTCAACCGAAGGAATAAAAAAATTTGAAAAA 

TATGCTATTAACACAATTAATGAATATATGAAAAATAATAAATATCATTTCTCAGGTGTTTATGATAAGGTATTAA 

TAGCTAAAACAATGGTAAAAGAACAGGTTATAAACATTATAACAACT.CTTGGATCAATAACACTACTACTTATGTT 

TTTCTTTAAATCTATAAAAACCGGAATAATTATTGCAATCCCAGTAGCATGGTCAGTGTTTTTAAACTTTGCTGTA 

ATGAGATTATTTGGGATAACCTTAAACCCCGCAACGGCAACAATTGCATCTGTAAGCATGGGAGTAGGAGTAGATT 

ATTCAATTCATTTTTTCAATACATTTATTTTACAATACCAAAAAAATCAAATCTACAJ^JVCTGCACTTCTTGAATC 

AATACCCAATGTATTTAATGGAATATTTGCAAATTCTATTTCTGTTGGAATAGGATTTTTAACTCTAACATTTTCG 
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TCTTATAAAATAATATCAACTCTTGGAGCAATAATTGCTTTTACAATGCTAACGACATCTCTTGCATCACTAACTC 
TTC TTC C ATTATTAATTTATTT ATTTAAAC C T AG AGT AAAGC T AGC C TC AAAC AAC AATTTT AAAAAATTAAAAC A 
ATAA 

t688.nt 

TACAAAACAATGGGTATTTTCATGTCAATTGGAGTAATTATCTCAATGATAATCTCATTAACCGTTTTACCTGGAA 
TAATAACATTAATCCCATTTGCAAAAAAAAAGTCTTTTGAA 

CCTTGAAAGACTTGCCAAACTAAATACGCAAATAACAAAATCTATATTAAAAAGAAAATATACATCCTCTATAATG 

GTCCTCATCATACTGGGAATTTCTTTTGTAGGTCTTTTAAAAATGGAAATCAATTTTGATGAAAAAGATTACTTTA 

AAGAAAGCACAAGTGTAAAAAAAACATTAAACCTAATGCAAAAAGAAATGGGGGGAATATCGAT 

AATTGAAGGCAGGCCCGGTGAATTTAAAAATGCTAAAGCAATGCAAATATTAGACTTAATTACAGATAAGCTTGAT 

GCATTTTCTGCAAAAACTCAATCTAGTTCTATTAATGGCATTTTAAAATTTACAAATTTTAAAATTAAAAAAGAAT 

CCCCACTAGAGTATAAACTGCCTGAAAATAAAATTATACTAAACAAACTAAT^ 

GACTAAGGACAATAAAAGAATGTACATTAACGATGACTGGTCATTAATATCTATCATAGTAAGAATTGAAGACAAC 

TCAACCGAAGGAATAAAAAAATTTGAAAAATATGCTATTAACACAATTAATGAATATATGAAAAA 

ATTTCTCAGGTGTTTATGATAAGGTATTAATAGCTAAAACAATGGTAAAAGAACAGGTTATAAACATTATAACAAC 

TCTTGGATCAATAACACTACTACTTATGTTTTTCTTTAAATCTATAAAAACCGGAATAATTATTGCAATCCCAGTA 

GCATGGTCAGTGTTTTTAAACTTTGCTGTAATGAGATTATTTGGGATAACCTTAAACCCCGCAACGGCAACAATTG 

CATCTGTAAGCATGGGAGTAGGAGTAGATTATTCAATTCATTTTTTCAATACATTTATTTTACAATACCAAAAAAA 

TCAAATCTACAAAACTGCACTTCTTGAATCAATACCCAATGTATTTJyVTGGAATATTTGCAAATTCTATTTCTGTT 

GGAATAGGATTTTTAACTCTAACATTTTCGTCTTATAAAATAATATCAACTCTTGGAGCAATAATTGCTTTTACAA 

TCCTAACGACATCTCTTGCATCACTAACTCTTCTTCCATTATTAATTTATTTATTTAAACCTAGAGTAAAGCTAGC 

CTCAAACAACAATTTTAAAAAATTAAAACAATAA '- . 

£704. aa ■ . 

M^TKFQEFISEFLGTFILLALGT^ 

AVSIGLASVGKFPVSKLLHYIVAQILGAFTGALMTLWFYPKWIEMDPGLENTQGIMATFPAVPGFLPGFIDQIFG 
TFLLMFLISVVGDFTKKHSDNPFIPFIVGAVVLSIGISFGGMNGYAINPARDLGPRILLLFAGFKNHGFNNLSIVI 
VPIIGPIIGAILGATIYEFTLKNNKD ' 

t704 . aa 

GEIIKGGYTNIVFGWGLGVTFGIYTAARMSGAHLNPAVSIGLA 

PKWIEMDPGLENTQGIMATFPAVPGFLPGFIDQIFGTFLLMFLISWGDFTKKHSDNPFIPFIVGAVVLSIGISFG 
• GMNGYAINPARDLGPRI LLLFAGFKNHGFNNLS I VI VP I IGPIIGAI LGAT I YEFTLKNNKD 

• • f704 .nt 

ATGAATTATACAAAATTCCAAGAATTTATATCGGAATTTTTGGGAACATTTATCCTATTGGCTCTAGGAACTGGAT 
CTGTTGCAATGACAGTATTATTTTCCTCAAGTCCCGAAATACCAGGAGAAATAATAAAAGGAGGATATACAAATAT 
AGTATTTGGATGGGGATTGGGTGTAACGTTTGGTATTTACACAGCAGCAAGAATGAGCGGAGCACACCTAAACCCA 
GCTGTTAGCATAGGATTAGCAA.GTGTTGGAAAGTTTCCCGTTTCAAAACTTTTACATTACATTGTAGCACAAATAT 
. TAGGAGCTTTTACAGGTGCATTAATGACACTTGTCGTATTTTATCCTAAATGGATAGAAATGGATCCTGGCTTAGA 
AAATACTCAAGGAATAATGGCAACTTTCCCTGCTGTTCCTGGATTTTTGCCTGGATTTATTGATCAAATTTTTGGA 
ACTTTTTTGCTAATGTTTTTAATTTCTGTTGTTGG^GATTTTACAAAAAAACACAGCGACAATCCATTTATTCCTT 
TTATTGTAGGAGCAGTGGTTTTATCAATAGGGATAAGTTTCGGAGGAATGAACGGTTATGCTATTAATCCTGCAAG 
GG ATCTGGG AC CAAGAATTTTACTCTTATTTGC TGGATTTAAAAATCACGGATTTAACAATCTAAGT AT AGTT ATT 
GTACCAATAATTGGCCCAATAATTGGAGCAATTTTGGGAGCTACAATTTACGAATTTACACTAAAAAATAACAAAG 
ACTAA < 

t704.nt 

GGAGAAATAATAAAAGGAGGATATACAJVATATAGTATTTGGA.TGGGGATTGGGTGTAACGTTTGGTATTTACACAG 
CAGCAAGAATGAGCGGAGCACACCTAAACCCAGCTGTTAGCATAGGATTAGCAAGTGTTGGAAAGTTTCCCGTTTC 
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AAAACTTTTACATTACATTGTAGCACAAATATTAGGAGCTTTTACAGGTGCATTAATGACACTTGTCGTATTTTAT 
CCTAAATGGATAGAAATGGATCCTGGCTTAGAAAATACTCAAGGAATAATGGCAACTTTCCCTGCTGTTCCTGGAT 
TTTTGCCTGGATTTATTGATCAAATTTTTGGAACTTTTTTGCTAATGTTTTTAATTTCTGTTGTTGGAGATTTTAC 
AAAAAAACACAGCGACAATCCATTTATTCCTTTTATTGTAGGAGCAGTGGTTTTATCAATAGGGATAAGTTTCGGA 
GGAATGAACGGTTATGCTATTAATCCTGCAAGGGATCTGGGACCAAGAATTTTACTCTTATTTGCTGGATTTAAAA 
ATCACGGATTTAACAATCTAAGTATAGTTATTGTACCAATAATTGGCCCAATAATTGGAGCAATTTTGGGAGCTAC 
AATTTACGAATTTACACTAAAAAATAACAAAG 

ACTAA ' • 

f707.aa 

.MRRLFLLYILCSFWLNLFAQGSSSYIDKQKELAIFYYE^ 
VXELDARIKDDNPKWMLEDIKLEEIPGira 

FDLFDSVIADKVNVMGQFESKNDFISTLSSASSKADADELEYLSV^ 
KNF PFWKERQTLI FTTEDDNNWFLS S INZ 

t7 07.aa ' 

MRRLFLLYILCSFVFLNLFAQGSSSYIDKQKELAIFYYEVGQRYINVGKIKKGKLFQAKALKIYPDLKKGFDIKLA 
VKELDARIKDDNPKVVMLEDIKLEEIPGIVHEKIEINDFTNAPKIEYIAQRERSKNQDKIIKFQFGKFARALISRN 
FDLFDSVIADKWVMGQFESKNDFISTLSSASSK^ 

KNFPFWKERQTLIFTTEDDNNWFLSSINZ ' ^ 

f707 .nt . .. " 

.ATGAGAAGATTATTTCTTCTATATATTTTATGTTCTTTTGTTTTTTTGAATTTATTTGCTCAAGGTAGTTCTTCTT 
ATATTGATAAGCAAAAAGAGCTTGCTATTTTTTATTATGAGGTTGGTCAAAGATATATAAACGTTGGTAAAATTAA 
AAAAGGAAAGCTTTTTCAAGCAAAAGCTTTAAAGATTTATCCAGATTTGAAAAAGGGGtTTGATATCAAGCTTGCA 
GTTAAAGAGCTTGATGCTAGGATTAAAGATGAGAATCCCAAGGTTGTTATGCTTGAGGATATTAAGCTTGAGGAGA 
TACCTGGAATAGTGCACGAAAAAATAGAAATCAATGATTTTACAAATG^ 

AGAGAGAAGCAAAAATCAAGATAAAATTATTAAGTTTCAATTTGGAAAGTTTGCAAGAGCTTTAATTTCTAGGAAC 
TTTGATTTGTTTGATTCAGTTATTGCGGATAAAGTTAACGTTATGGGTCAATTTGAATCAAAAAATGATTTTATAT ' 
CAACTTTATCAAGTGCTTCATCTAAGGCCGATGCTGATGAGTTAGAGTATTTATCAGTTGATGATTATTACGATTT 
AAAGTCTTTAAAAATTTCAAAATCCAACGATACTTCTTTTGCTGTTAATGTTAATGCCAAAAAAAATGATGTTAC^ 
AAAAATTTTCCATTTTGGAAAGAACGTCAAACTTTAATTTTTACTACAGAGGATGATAATAATTGGTTTTTGTCTT 

CCATAAATTGA 

t707 .nt ^ * 

CAAGGTAGTTCTTCTTATATTGATAAGCAAAAAGAGCTTGCTATTTTTTATTATGAGGTTGGTCAAAGATATATAA 
ACGTTGGTAAAATTAAAAAAGGAAAGCTTTTTCAAGCAAA^ 

TGATATCAAGCTTGCAGTTAAAGAGCTTGATGCTAGGATTAAAGATGACAATCCCAAGGTTGTTATGCTTGAGGAT 
ATTAAGCTTGAGGAGATACCTGGAATAGTGCACGAAAAAATAGAAATCAATGATTTTACAAATGCTCCTAAAATAG 
AATATATTGCTCAAAGAGAGAGAAGCAAAAATCAAGATAAAATTATTAAGTTTCAATTTGGAAAGTTTGCAAGAGC 
TTTAATTTCTAGGAACTTTGATTTGTTTGATTCAGTTATTGCGGATAAAGTTAACGTTATGGGTCAATTTGAATCA 
AAAAATGATTTTATATCAACTTTATCAAGTGCTTCATCTAAGGCCGATGCTGATGAGTTAGAGTATTTATCAGTTG 
ATGATTATTACGATTTAAAGTCTTTAAAAATTTCAAAATCCAACGATACTTCTTTTGCTGTTAATGTTAATGCCAA 
AAAAAATGATGTTACTAAAAATTTTCCATTTTGGAAAGAACGTCAAACTTTAATTTTTACTACAGAGGATGATAAT 
AATTGGTTTTTGTCTTCCATAAATTGA 

f709.aa - 

MLIFGFIGLFFI^IFSLKAQGIVTNKDAQEEFKW^^ 

YVEEALMEWRNLKDQGYKVPYLRHLISTIEQRRGIFSNYEL^ 

YAAJtfFVGNEILYFDVNNITV/NALVKDGFSYLKSPYDV^ 

LLAPQYMAIDKRNYIYVSEWGNKRVSKFGLEGDFILHFGSRTSGYKGLLGP^^ 
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SGNHLYSVFTSIEGIEGLSSDFVGNNVIVSSKDGVYKYSIAKKTITKILKADKMNSKXSSSILDANNQMIVSDFNN 
AKVSVYKSDASLYDSLNVDVRRXIRLGGPKIYVELNVSSKSGLPVVGLKSE 

INXAWFDKSSYMKKYDTDQIVGLNALMELSKNKNFSFINATSVPIIDNIESLTNSIRNTSSLGPYSTDAVKTDVS 
LKLAGSGLMSKSSRRAWYFSGGILNRKAFEKY 

FSSYEGVSKVYDLILEQKTGTYLLEYYYPGPQEPNKYFNLSVEANINQQTGRGEFAYFIN 
t709.aa 

QGIVTNKDAQEEFKWALNSYNNGIYDDALL^ 

PYLRHLISTIEQRRGIFSNYELNFKKLVKYASLDNSIYKRPHGYQITSLRADKYGGYYAANFVGNEILYFDVNNNV 
NALVKDGFSYLKSPYDVIEAT^LLYVTLYSSDEIGVYDKVL 

wgnkrvskfglegdfilhfgsrtsgykgllgptgvtylneniyvadslrntievfdtsgnhlysvftsiegiegls 
sdfvgnnvivsskdgvykysxakktitkxlkadkmns^^ 

vrriirlggpkiyvelnvssksglpvvglksenfsisnenyyivnpkvaynvnaskdiniavv 
qivglnalmelsknknfsfinatsvpixdniesltnsirntsslgpystdavktdvslklagsglmskssrravvy 
fsggilnrkafekysldtivsyyknndirfylilfgndpinsklqylvnetggavipfssyegvskvydlileqkt 
gtylleyyy'pgpqepnkyfnlsveaninqqtgrgefayfin 

f709.nt . 

atgttaatttttggttttattggtttgttttttttaaatatttttagtttgcatgcccaaggaatagttactaata 

aagatgctcaagaagagtttaaatgggctcttaattcttataataatggaatttacgatgatgctcttttatcttt 

taaaaaaattttaagctttgatcctaataatcttgattatcatttttggactggcaatgtttattatagactgggt 

tatgttgaagaagctttaatggaatggagaaatttaaaagatcaaggctataaggttccctatcttagacatttga 

tttctactattgagcaaaggagaggtattttttcaaattatgaacttaattttaaaaaacttgtaaaagttgcttc 

tcttgataattctatttataaaaggccacatgggtaccagattacatctttaagggctgataagtacggcggatat 

tacgctgctaactttgtaggcaatgaaatattgtattttgatgttaataacaatgttaatgctttagttaaagatg' 

gctttagttatttaaaatcaccttatgatgttattgaagctaataatctgctttatgtgactctttattcaagtga 

tgaaattggtgtttatgacaaagttcttggagttaaaaggaaatctattgggaataaaggcacaaaagatggcgaa 

ttgcttgctcctcagtatatggctattgataagagaaactatatttatgtaagtgagtggggaaataaaagagtaa 

gtaaatttggacttgaaggtgattttattttgcattttggttctagaagttcaggctataagggccttttaggtcc 

cacaggcgttacttatttgaatgaaaacatttatgttgcagattctctgagaaataccattgaagtttttgatact 

agtggtaatcatttatattcagtttttacttctattgagggaatagaggggcttagcagtgattttgtaggtaata 

atgttatagtatcctcaaaagatggtgtttataaatatagcattgctaaaaagacaattacaaaaattttaaaag 

agataaaatgaattctaaaatttcttcatctattttggatgccaataatcagatgattgtctcagattttaataat 

gccaaggtttcagtttacaagagtgatgcaagcctttatgatagtttaaatgttgatgttagaagaataattaggc 

ttggagggcctaaaatttacgttgagcttaatgttagcagtaaaagcggattaccagttgttgggcttaaaagtga 

aaatttttcaatttcaaatgaaaattattacattgtcaatcccaaggtggcatataatgtaaatggttcaai^gac 

attaatatagcagttgtttttgataaatcttcttatatgaaaaaatatgatacagatcaaattgtagggttaaatg 

ccctaatc^agttgtcaaaaaataaaaactttagttttataaatgcaacaagtgtgcccattatagataatattga 

aagcttaacaaatagcattagaaatacaagttctcttggtccttatagtacagatgctgtaaaaacagacgttagt 

ttgaagttggcaggttctgggcttatgtcaaaaagctgaagaagagcagtagtttattttagtggtggtattttaa 

atcgtaaagcttttgaaaagtactctttagatacaatagtaagctattataaaaataatgatataaggttttactt 

aatactatttggtaatgatcctattaatagtaagcttcagtatttagttaatgaaacaggcggtgctgtaattcct 

ttttcatcttatgaaggtgtatctaaagtttatgatttaattttagaacaaaaaacgggcacttatttgttggaat 

attattatccaggccctcaagaacctaataaatattttaatttatctgttgaagcaaatataaatcaacagacagg 

aagaggggagtttgcatattttattaattag 

t709.nt . 

C AAGGAATAGTTAC T AAT AAAGATGC TC AAGAAGAGTTT AAATGGGC TC TT AATTC TT ATAAT AATGG AATTT AC G 
ATGATGCTCTTTTATCTTTTAAAAAAATTTTAAGCTTTGATCCTAATAATCTTGATTATCATTTTTGGACTGGCAA 
TGTTTATT AT AGACTGGGTTATGTTGAAGAAGCTTT AATGG AATGGAGAAATTTAAAAGATCAAGGCTATAAGGTT 
CCCTATCTTAGACATTTGATTTCTACTATTGAGCAAAGGAGAGGTATTTTTTCAAATTATGAACTTAATTTTAAAA 
AACTTGTAAAAGTTGCTTCTCTTGATAATTCTATTTATAAAAGGCCACATGGGTACCAGATTACATCTTTAAGGGC 
TG AT AAGTACGGCGGATATT ACGCTGC TAAC TTTGT AGGC AATG AAAT ATTGT ATTTTGATGTTAAT AAC AATGTT 
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AATGCTTTAGTTAAAGATGGCTTTAGTTATTTAAAATCACCTTATGATGTTATTGAAGCTAATAATCTGCTTTATG 
TGACTCTTTATTCAAGTGATGAAATTGGTGTTTATGACAAAGTTCTTGGAGTTAAAAGGAAATCTATTGGGAATAA 
AGGCACAAAAGATGGCGAATTGCTTGCTCCTCAGTATATGGCTATTGATAAGAGAAACTATATTTATGTAAGTGAG 
TGGGGAAATAAAAGAGTAAGTAAATTTGGACTTGAAGGTGATTTTATTTTGCATTTTGGTTCTAGAACTTCAGGCT 
ATAAGGGCCTTTTAGGTCCCACAGGCGTTACTTATTTGAATGAAAACATTTATGTTGCAGATTCTCTGAGAAATAC 
CATTGAAGTTTTTGATACTAGTGGTAATCATTTATATTCAGTTTTTACTTCTATTGAGGGAATAGAGGGGCTTAGC 
AGTGATTTTGTAGGTAATAATGTTATAGTATCCTCAAAAGATGGTGTTTATAAATATAGCATTGCTAAAAAGACAA 
TTACAAAAATTTTAJ^AAGCAGATAAAATGAATTCTAAAATTTCTTCATCTATTTTGGATGCCAATAATCAGATGAT 
TGTCTCAGATTTTAATAATGCCAAGGTTTCAGTTTACAAGAGTGATGCAAGCCTTTATGATAGTTTAAATGTTGAT 
GTTAGAAGAATAATTAGGCTTGGAGGGCCTAAAATTTACGTTGAGCTTAATGTTAGCAGTAAAAGCGGATTACCAG 
TTGTTGGGCTTAAAAGTGAAAATTTTTCAATTTCAAATGAAAATTATTACATTGTCAATCCCAAGGTGGCATATAA 
TGTAAATGCTTCAAAAGACATTAATATAGCAGTTGTTTTTGATAAATCTTCTTATATGAAAAAATATGATACAGAT 
CAAATTGTAGGGTTAAATGCCCTAATGGAGTTGTCAAAAAJ^TAAAAACTTTAGTTTTATAAATGCAACAAGTGTGC 
CCATTATAGATAATATTGAAAGCTTAACAAATAGCATTAGAAATACAAGTTCTCTTGGTCCTTATAGTACAGATGC 
TGTAAAAACAGACGTTAGTT?TGAAGTTGGCAGGTTCTGGGCTTATGTCAAAAAGCTCAAGAAGAGCAGTAGTTTAT 
TTTAGTGGTGGTATTTTAAATCGTAAAGCTTTTGAAAAGTACTCTTTAGATACAATAGTAAGCTATTATAAAAATA 
ATGATATAAGGTTTTACTTAATACTATTTGGTAATGATCCTATTAATAGTAAGCTTCAGTATTTAGTTAATGAAAC 
AGGCGGTGCTGTAATTCCTTTTTCATCTTATGAAGGTGTATCTAAAGTTTATGATTTAATTTTAGAACAAAAAAGG 
GGCACTTATTTGTTGGAATATTATTATCCAGGCCCTCAAGAACCTAATAAATATTTTAATTTATCTGTTGAAGCAA 
AT AT AAATCAACAGACAGGAAGAGGGGAGTTTGCATATTTTATTAATTAG 

f730,aa ... 

' MIKSILDYLLTLHPVLLGLLGSTFTWFTTAFGAAAVFFFRKVDNKIMDAI4LGFSAGIMIAASFFSLIQPAIERAEE 
LGYITWVPAVFGFLVGAFFIYIVDVFVPDLDKLTFIDEDLTKHGKKDFLLFTAVTLHNFPEGLAVGVAFGALASNP 
DIQTLVGAMLLTLGIGIQNIPEGAAISLPLRRGNVALAKCFNYGQMSGLVEIVGGLMGAYAVYSFTRILPFALAFS 
AGAMI YVS I EQL I PEAKRKDIDNKVPS IFGVIGFTLMMFLDVSLGZ . 

t7 3 0.aa • 

AVFFFRKVDNKIMDAMLGFSAGIMIAASFFSLIQPAIEPAEELGYITWVPAVFGFLVGAFFIYIVDVFVPDLDKLT 
FIDEDLTKHGKKDFLLFTAVTLHNFPEGLAVGVAFGALASNPDIQTLVGAMLLTLGIGIQNIPEGAAISLPLRRGN 
VALAKCFNYGQMSGLVEIVGGLMGAYAVYSFTRILPFALAFSAGAMIYVSIEQLIPEAKRKDIDNKVPSIFGVIGF 
TLMMFLDVSLGZ 

f730.nt . . 

ATGATAAAATCAATTTTAGATTATTTATTAACTTTGCATCCTGTATTATTGGGACTTTTAGGTTCTACTTTCACTT 
GGTTTACTACAGCTTTTGGAGCAGCAGCAGTTTTTTTCTTTAGAAAGGTAGATAATAAAATAATGGACGCTATGCT 
TGGTTTTTCAGCTGGCATTATGATAGCGGCCAGTTTTTTTTCGCTTATTCAGCCTGCTATAGAAAGAGCTGAAGAG 
CTTGGATACATTACTTGGGTGCCGGCTGTTTTTGGATTTCTTGTTGGGGCATTTTTTATATATATTGTAGATGTAT 
TTGTTCCAGATCTGGATAAACTTACTTTTATTGATGAAGACTTAACTAAACATGGTAAAAAAGATTTTTTACTCTT 
TACTGCTGTTACTTTACATAATTTTCCAGAAGGATTGGCTGTTGGAGTTGCTTTTGGAGCCTTGGCGTCTAATCCA 
GATATTCAAACTTTAGTTGGGGCTATGCTTCTTACGCTTGGTATTGGTATTCAAAATATTCCCGAAGGAGCAGCTA 
TTTCTCTGCCTTTAAGAAGAGGTAATGTTGCTTTGGCAAAATGCTTTAACTATGGCCAAATGTCAGGATTGGTAGA 
AATTGTGGGGGGGCTTATGGGTGCTTATGCGGTTTATTCTTTTACTCGAATTTTAGCTTTTGCTTTGGCTTTTTCT- 
GCAGGAGCTATGATTTATGTGTCAATTGAACAATTAATACCTGAAGCTAAGAGAAAAGACATTGACAATAAAGTGC 
CAAGTATATTTGGTGTTATTGGTTTTACATTAATGATGTTTCTCGATGTTTCACTAGGTTAA 

t730.nt * 

GCAGTTTTTTTCTTTAGAAAGGTAGATAATAAAATAATGGACGCTATGCTTGGTTTTTCAGCTGGCATTATGATAG 
CGGCCAGTTTTTTTTCGCTTATTCAGCCTGCTATAGAAAGAGCTGAAGAGCTTGGATACATTACTTGGGTGCCGGC 
TGTTTTTGGATTTCTTGTTGGGGCATTTTTTATATATATTGTAGATGTATTTGTTCCAGATCTGGATAAACTTACT 
TTTATTGATGAAGACTTAACTAAACATGGTAAAAAAGATTTTTTACTCTTTACTGCTGTTACTTTACATAATTTTC 
CAGAAGGATTGGCTGTTGGAGTTGCTTTTGGAGCCTTGGCGTCTAATCCAGATATTCAAACTTTAGTTGGGGCTAT 
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GCTTCTTACGCTTGGTATTGGTATTCAAAATATTCCCGAAGGAGCAGCTATTTCTCTGCCTTTAAGAAGAGGTAAT 
GTTGCTTTGGCAAAATGCTTTAACTATGGCCAAATGTCAGGATTGGTAGAAATTGTGGGGGGGCTTATGGGTGCT-T 
ATGCGGTTTATTCTTTTACTCGAATTTTACCTTTTGCTTTGGCTTTTTCTGCAGGAGCTATGATTTATGTGTCAAT 
TGAACAATTAATACCTGAAGCTAAGAGAAAAGACATTGACAATAAAGTGCCAAGTATATTTGGTGTTATTGGTTTT 
ACATTAATGATGTTTCTCGATGTTTCACTAGGTTAA 



fl97.aa 

MLLKLKYRFVGFLLLFLTFILLLFSTIFNFVLCGYLEDYYKQLTRAQVRRAAFSLQSFLDTLHVIINGAASNIJ^LE 
TISEFAMSENRGKDFSESELIDLRKNPKFVIDSVKVSKKYRQYLYNFMANLKNDTLFEEFAFFDFEGRVIVSTRHE 
NNMDFGHSEANTOTFKKAVEDYRQNQLKFIGWYSNLSEGISAEVAIRSKQSEKKAFAIIVPVYSPEDKLVC 
YLLOTIVADSFDRFRFGFYKRGNFIYVDPNNIAVNPFEEYNETSRVSSKFLNVLKDVFSKPPFPSNIASEVSVYT^ 
DRILLSEMGEDCYYAMLPISSKLGEKSGVLIARLPYKDIYGVISSLRFQYILYSVLGIIALSIVLSIRIDRIISFR 
'LNAIRVLVQDMVKGNLDKDYALDDDENTL 

SALQQASALEEMSANVEQIASGVNMSAJ^ISYETEQIALKTNENSQIGGRAVEESVIAMQDIVEKVSVIEEIARKTN 
LLALNAAIEAARAGDEGKGFAWASEIRKLADLSKISALEIGELVEDNSKVATEAGVIFKEMLPEIEETANLVKKI 
SEGSSKQSDQIAQFKMALDQVGEWQSSASSSEQLSSMSDKMLEKSKELRKSVLFFKIKDSKIENPENDDYDFRLI 
DCPENSFKDENQNLKSNGISTSNASGHNNYSLDIESESSVRTXNKRVDPKKAIDIADKDLNFDDDFSEF 

_tl97.aa ' ' 

VLCGYLEDYYKQLTRAQVRRAAFSLQSFLDTLHVIINGAASNLALETISEFAMSENRGKDFSESELIDLRKNPKFV 
IDSVKVSKKYRQYLYNFMANLKNDTLFEEFAFFDFEGRVIVST 

GWYSNLSEGISAEVAIRSKQSEKKAFAIIVPVYSPEDKLVCGYLAGYLLND'IVADSFDRFRFGFYKRGNFIYVDPN- 
NIAVNPFEEYNETSRVSSKFLNVLKDVFSKPPFPSNIASEVSVYTIDRILLSEMGEDCYYAMLPISSKLGEKSGVL 
IARLPYKDIYGVISSLRFQYILYSVLGIIALSIVLSIRIDRIISFRLNAIRVLVQDMVKGNLDKDYALDDDENTLD 
ELGMLSLQWKMKKAISVAISSVLRNISYWKASLEVASSSQN^ 

YETEQIALKTNENSQIGGRAVEESVIAMQDIVEKVSVIEEIARKTNLLALNAAIEAARAGDEGKGFAWASEIRKL 
ADLSKI S ALE IGELVEDNS KVATEAGVI FKEMLP EI EETANL VKKI S EGS S KQSDQ I AQFKMALDQVGE WQS S AS 
S S EQLS SMSDKMLEKSKELRKS VLF FKI KDS KI ENPENDDYDFRL I DC P ENS FKDENQNLKSNGI STSNASGHNNY 
-SLDIESESSVRTINKRVDPKKAIDIADKDLNFDDDFSEF 

fl97.nt . 

ATGTTATTGAAGCTTAAATACAGGTTTGTTGGATTTTTATTATTGTTTTTAATTTTTATACTGCTACTTTTTTCCA 
CGATTTTTAATTTTGTTTTATGCGGT?ATTTAGAAGATTATTATAAGCAGCTTACAAGGGCGCAAGTAAGAAGAGC 
AGCTTTTTCTTTGCAATCTTTTTTAGACACCCTGCATGTCATAATCAATGGTGCAGCTTCTAATTTGGCACTTGAA 
ACCATATCAGAATTTGCAATGTCTGAGAATAGAGGAAAAGATTTCTCTGAGTCGGAATTGATAGATTTAAGAAAAA 
ATCCAAAATTTGTTATTGACTCTGTAAAGGTGAGCAAAAAATATCGACAATACTTATACAATTTTATGGCCAATCT 
TAAAAATGATACCCTTTTTGAAGAATTCGCTTTTTTTGATTTTGAAGGGAGAGTAATTGTTAGCACAAGACATGAG 
AATAATATGGATTTTGGTCATTCTGAGGCTAATACCAATTATTTTAAAAAAGCTGTTGA(^ATTATAGGCAAAACC 
AATTAAAATTTATAGGTTGGTATTCAAATCTTTCTGAAGGAATATCCGCAGAAGTTGCTATTAGGTCTAAACAA 
CGAAAAAAAGGCTTTTGCAATAATTGTACCTGTATATTCCCCAGAAGATAAACTTGTTTGTGGGTATTTGGCCGGA 
TATTTGCTTAATGATATTGTGGCAGATAGTTTTGATAGATTTAGATTCGGTTTTTATAAAAGAGGCAATTTTATTT 
ATGTGGATCCCAACAATATAGCAGTTAATCCTTTTGAAGAATATAATGAAACCAGCAGGGTTAGTTCTAAATTTTT 
GAATGTTCTTAAAGATGTTTTCTCTAAGCCCCCTTTTCCATCAAACATTGCCAGTGAAGTGTCGGTTTACACTATT 
GATAGAATACTTTTGTCCGAAATGGGAGAAGATTGTTATTATGCAATGTTGCCCATAAGTAGTAAATTGGGAGAAA 
AGAGTGGAGTACTTATTGCTAGGCTTCCTTATAAGGATATTTACGGAGTAATATCTAGTCTAAGATTTCAGTATAT 
TTTATATTCAGTCTTAGGCATTATAGCATTAAGTATTGTTCTTTCAATTAGAATAGACAGGATTATTAGTTTTCGT. 
TTAAACGCAATTAGAGTTCTAGTTCAAGATATGGTTAAGGGCAATTTAGATAAAGATTATGCTCTTGATGATGATG 
-AAAATACTCTTGATGAACTTGGCATGTTAAGTCTTCAGGTTGTTAAAATGAAAAAAGCTATTTCTGTAGCAATTTC 
TAGTGTTTTGAGAAATATTAGCTATGTAAATAAGGCAAGTTTAGAAGTTGCCAGTTCAAGTCAAAATTTAAGCTCT 
AGTGCATTGCAACAGGCATCTGCTCTTGAAGAAATGTCAGCTAATGTTGAGCAAATAGCCTCAGGTGTCAACATGA 
GCGCCAATAATTCTTATGAAACAGAACAAATAGCTTTAAAGACGAATGAAAATTCTCAGATAGGTGG 
TGAA.GAATCTGTTATTGCTATGCAAGACATTGTGGAGAAAGTTAGTGTTATTGAAGAGATAGCTAGAAAAACCAAT 
TTACTTGCTTTGAATGCGGCTATTGAAGCTGCAAGAGCAGGAGATGAGGGAAAGGGATTTGCTGTTGTGGCCAGTG 
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AGATTAGAAAGTTGGCTGATTTGAGTAAAATTTCTGCTCTTGAGATTGGAGAGTTAGTTGAAGATAACTCTAAGGT 
AGCAACTGAAGCGGGAGTGATCTTTAAAGAAATGCTACCCGAAATTGAAGAAACGGCTAATCTTGTTAAGAAGATT 
TCAGAAGGTAGCTCTAAGGAAAGCGATCAGATTGCTCAATTTAAAATGGCTTTAGATCAGGTTGGAGAAGTTGTTC 
AATCTTCAGCTTCAAGCAGTGAGCAGCTTTCTAGTATGTCCGATAAAATGTTAGAAAAGTCTAAGGAACTTAGAAA 
ATCTGTATTATTTTTCAAAATTAAAGATTCTAAAATTGAAAATCCAGAAAATGATGATTATGATTTCAGGTTAATA 
GATTGTCCTGAAAATTCTTTTAAAGATGAAAATCAAAATTTGAAAAGCAATGGAATTTCTACTTCAAATGCCAGTG 
GGCATAATAATTATTCTTTAGATATTGAGAGCGAATCTTCTGTAAGAACTATTAATAAGCGAGTTGATCCTAAAAA 
AGCTATCGATATTGCTGATAAGGATTTAAATTTTGATGATGATTTTTCAGAGTTTTAG 

tl97.nt " . 

GTTTTATGCGGTTATTTAGAAGATTATTATAAGCAGCTTACAAGGGCGCAAGTAAGAAGAGCAGCTTTTTCTTTGC 
AATCTTTTTTAGACACCCTGCATGTCATAATCAATGGTGCAGCTTCTAATTTGGCACTTGAAACCATATCAGAATT 
TGCAATGTCTGAGAATAGAGGAAAAGATTTCTCTGAGTCGGAATTGATAGATTTAAGAAAAAATCCAAAATTTGTT 
ATTGACTCTGTAAAGGTGAGCAAAAAATATCGACAATACTTATACAATTTTATGGCCAATCTTAAAAATGATACCC 
TTTTTGAAGAATTCGCTTTTTTTGATTTTGAAGGGAGAGTAATTGTTAGCACAAGACATGAGAATAATATGGATTT 
TGGTCATTCTGAGGCTAATACCAATTATTTTAAAAAAGCTGTTGAGGATTATAGGCAAAACCAATTAAAATTTATA 
GGTTGGTATTCAAATCTTTCTGAAGGAATATCCGCAGAAGTTGCTATTAGGTCTAAACAAAGCGAAAAAAAGGCTT 
TTGCAATAATTGTACCTGTATATTCCCCAGAAGATAAACTTGTTTGTGGGTATTTGGCCGGATATTTGCTTAATGA 
TATTGTGGCAGATAGTTTTGATAGATTTAGATTCGGTTTTTATAAAAGAGGCAATTTTATTTATGTGGATCCCAAC 
AATATAGCAGTTAATCCTTTTGAAGAATATAATGAAACCAGCAGGGTTAGTTCTAAATTTTTGAATGTTCTTAAAG 
ATGTTTTCTCTAAGCCCCCTTTTCCATCAAACATTGCCAGTGAAGTGTCGGTTTACACTATTGATAGAATACTTTT 
GTCCGAAATGGGAGAAGATTGTTATTATGCAATGTTGCCCATAAGTAGTAAATTGGGAGAAAAGAGTGGAGTACTT 
ATTGCTAGGCTTCCTTATAAGGATATTTACGGAGTAATATCTAGTCTAAGATTTCAGTATATTTTATATTCAGTCT 
TAGGCATTATAGCATTAAGTATTGTTCTTTCAATTAGAATAGACAGGATTATTAGTTTTCGTTTAAACGCAATTAG 
AGTTCTAGTTCAAGATATGGTTAAGGGCAATTTAGATAAAGATTATGCTCTTGATGATGATGAAAATACTCTTGAT 
GAACTTGGCATGTTAAGTCTTCAGGTTGTTAAAATGAAAAAAGCTATTTCTGTAGCAATTTCTAGTGTTTTGAGAA 
ATATTAGCTATGTAAATAAGGCAAGTTTAGAAGTTGCCAGTTCAAGTCAAAATTTAAGCTCTAGTGCATTGCAACA 
GGCATCTGCTCTTGAAGAAATGTCAGCTAATGTTGAGCAAATAGCCTCAGGTGTCAACATGAGCGCCAATAATTCT 
TATGAAACAGAACAAATAGCTTTAAAGACGAATGAAAATTCTCAGATAGGTGGTAGGGCCGTTGAAGAATCTGTTA 
TTGCTATGCAAGACATTGTGGAGAAAGTTAGTGTTATTGAAGAGATAGCTAGAAAAACCAATTTACTTGCTTTGAA 
TGCGGCTATTGAAGCTGCAAGAGCAGGAGATGAGGGAAAGGGATTTGCTGTTGTGGCCAGTGAGATTAGAAAGTTG 
GCTGATTTGAGTAAAATTTCTGCTCTTGAGATTGGAGAGTTAGTTGAAGATAACTCTAAGGTAGCAACTGAAGCGG 
GAGTGATCTTTAAAGAAATGCTACCCGAAATTGAAGAAACGGCTAATCTTGTTAAGAAGATTTCAGAAGGTAGCTC 
TAAGCAAAGCGATCAGATTGCTCAATTTAAAATGGCTTTAGATCAGGTTGGAGAAGTTGTTGAATCTTCAGCTTCA 
AGCAGTGAGCAGCTTTCTAGTATGTCCGATAAAATGTTAGAAAAGTCTAAGGAAC.TTAGAAAATCTGTATTATTTT 
TCAAAATTAAAGATTCTAAAATTGAAAATCCAGAAAATGATGATTATGATTTCAGGTTAATAGATTGTCCTGAAAA 
TTCTTTT AAAG ATGAAAATC AAAATTTGAAAAGC AATGGAATTTCT AC TTC AAATGC C AGTGGGC AT AAT AATT AT 
TCTTTAGATATTGAGAGCGAATCTTCTGTAAGAACTATTAATAAGCGAGTTGATCCTAAAAAAGCTATCGATATTG 
CTGATAAGGATTTAAATTTTGATGATGATTTTTCAGAGTTTTAG 

f200.aa 

MTISKNVFSKFILKFLNSSAFVSVFALFVGFLIVGLVVMGLGHSPFRMYFIILEIIFSSPKHLGYVLSYSAPLIFT 

GLSIGISLKAGLFNIGVEGQFILGSIVALIASVtiXiDLPPILHVITIFIITFLASGSLGILIGYLKAKFNISEVISG - 

IMFNWILFHLNNIILDFSFXKRDNSDFSKPIKESAYIDFLASWKLSPEGLAYRSSHPFVNELLKAPLHFGIILGII 

FAILIWFLLNKTIIGFKINATGSNIEASRCMGINVKAVLIFSMFLSAAVAGLAGAIQLMGVNKAIFKLSYMQGIGF 

NGIAASLMGNNSPIGIIFSSILFSILLYGSSRVQSLMGLPSSIVSLMMGIIVLVISASYFLNKIVLKGVKRVKYNN 

ILD 



t200.aa 

LVVMGLGHSPFRMYFIILEIIFSSPKHLGYVLSYSAPLIFTGLSIGISLXAGLFNIGVEGQFILGSIVALIASVXjL 
DLPPILHVITIFIITFtJ^SGSLGILIGYLKAKFNISEVISGIMFNWILFKLNNIILDFSFIKRDNSDFSKPIKESA 
YIDFLASWKLSPEGLAYRSSHPFVNELLKAPLHFGIILGIIFAILIWFLLNKTIIGFKINATGSNIEASRCMGINV 



Ill 

TABLE L Nucleotide and Amino Acid Sequences 

KAVLIFSMFLSAAVAGLAGAIQLMGVNKAIFKLSYMQGIGFNGIAASLMGNNSPIGIIFSSILFSILLYGSSRVQS 
LMGLPSSIVSLMMGIIVLVISASYFLNKIVLKGVKRVKYNNILD 

£200. nt ■ 

ATGACAATTAGTAAAAACGTATTTAGTAAATTTATTTTGAAATTTTTAAATTCTTCAGCATTTGTTAGTGTATTTG 

CTCTATTTGTTGGATTTTTAATTGTTGGGCTAGTGGTGATGGGGCTTGGTCATTCTCCTTTTAGAATGTATTTTAT 

AATATTAGAAATTATTTTTTCTTCTCCCAAACATTTAGGTTATGTTTTAAGTTATTCAGCTCCTTTGATTTTTACA 

GGTCTTTCTATTGGTATTTCTTTAAAAGCGGGTCTTTTTAATATTGGGGTTGAAGGCCAGTTTATACTAGGATCTA 

TTGTTGCTTTAATAGCATCAGTTTTACTTGATTTGCCTCCAATTTTACATGTAATTACTATTTTTATTATTACTTT 

TTTAGCTTCAGGCAGTTTAGGAATTTTAATCGGATATTTAAAAGCCAAATTCAATATTAGCGAAGTGATTTCAGGA 

ATAATGTTTAATTGGATATTATTTCATTTAAATAATATAATTTTAGATTTTAGTTTTATTAAAAGAGATAATAGTG 

ATTTTTCAAAACCCATTAAAGAAAGCGCATATATTGATTTTTTAGCTTCTTGGAAGCTCTCACCAGAAGGTCTTGC 

TTATAGATCTTCTCATCCTTTTGTTAATGAGCTTTTAAAAGCACCTCTTCATTTTGGAATAATTTTAGGTATAATT 

TTTGCTATTTTAATATGGTTTTTACTTAATAAAACTATTATTGGATTTAAAATAAATGCCACAGGAAGTAATATTG 

AAGCTTCAAGATGTATGGGTATTAATGTAAAAGCTGTGCTAATTTTTTCAATGTTTCTCTCAGCAGCTGTTGCAGG ' 

TCTTGCTGGTGCTATTCAACTTATGGGTGTTAATAAAGCTATATTTAAGCTTTCTTATATGCAAGGAATTGGTTTT 

AATGGGATAGCTGCTTCTCTTATGGGAAACAATTCGCCAATTGGCATAATATTTTCTAGCATTCTTTTTTCTATAT 

TGCTTTATGGAAGCAGTAGAGTTCAAAGTTTAATGGGCCTTCCATCTTCAATTGTATCTTTGATGATGGGAATAAT 

TGTTCTTGTAATTTCTGCTAGCTATTTTTTAAATAAAATTGTTTTAAAAGGTGTTAAGCGTGTCAAATATAATAAT 

ATTCTTGATTAG 

.t200.nt - " ■ * " 

GGGCTAGTGGTGATGGGGCTTGGTCATTCTCCTTTTAGAATGTATTTTATAATATTAGAAATTATTTTTTCTTCTC 
CCAAACATTTAGGTTATGTTTTAAGTTATTCAGCTGCTTTGATTTTTACAGGTCTTTCTATTGGTATTTCTTTAAA 
AGCGGGTCTTTTTAATATTGGGGTTGAAGGCCAGTTTATACTAGGATCTATTGTTGCTTTAATAGCATCAGTTTTA 
CTTGATTTGCCTCCAATTTTACATGTAATTACTATTTTTATTATTACTTTTTTAGCTTCAGGCAGTTTAGGAATTT 
TAATCGGATATTTAAAAGCCAAATTCAATATTAGCGAAGTGATTTCAGGAATAATGTTTAATTGGATATTATTTCA 
TTTAAATAATATAATTTTAGATTTTAGTTTTATTAAAAGAGATAATAGTGATTTTTCAAAACCCATTAAAGAAAGC • 
GCATATATTGATTTTTTAGCTTCTTGGAAGCTCTCACCAGAAGGTCTTGCTTATAGATCTTCTCATCCTTTTGTTA 
ATGAGCTTTTAAAAGCACCTCTTCATTTTGGAATAATTTTAGGTATAATTTTTGCTATTTTAATATGGTTTTTACT 
. TAATAAAACTATTATTGGATTTAAAATAAATGCCACAGGAAGTAATATTGAAGCTTCAAGATGTATGGGTATTAAT 
GTAAAAGCTGTGCTAATTTTTTCAATGTTTCTCTCAGCAGCTGTTGCAGGTCTTGCTGGTGCTATTCAACTTATGG 
GTGTTAATAAAGCTATATTTAAGCTTTCTTATATGCAAGGAATTGGTTTTAATGGGATAGCTGCTTCTCTTATGGG 
AAACAATTCGCCAATTGGCATAATATTTTCTAGCATTCTTTTTTCTATATTGCTTTATGGAAGCAGTAGAGTTCAA 
AGTTTAATGGGCCTTCCATCTTCAATTGTATCTTTGATGATGGGAATAATTGTTCTTGTAATTT.CTGCTAGCTATT 
TTTTAAATAAAATTGTTTTAAAAGGTGTTAAGCGTGTCAAAT AT AATAAT ATTCTTGATTAG 

f 208 . aa 

MVKKFSIFLKAIIIFSIFELLIEELSIILFLPYKIRFALIFLGFLFDTIFIFIFLYKITKAYLSQRLEIYVRNNLF 
FDIIHCLIPLAFYSSYQLKNIIVAHETILNPIMLSLFKLRFLRLLRFNDLIIEIYYNSKEKNLILIAFARTFSMSL 
LIPFTFFIIISSSKIVlSrSIPEKQEFNIIKNISIINEKAYIKEKYPFILIIKEKDDIIYSKSDEIFVYYSPSEYRVI 
EMEKTKFYIDKYLQRKSDSILGIFLFTLFASFTIFLMNFYKFFKASFLNPIILMTKILQDPLEYRKIQIPFTLSEE 
KVYELAKSFNNLLLKEKLNSKRKSKIPLEIEKVKKIINKNQEIK 

t208.aa 

IIIFSIFELLIEELSIILFLPYKIRFALIFLGFLFDTIFIFIFLYKITKAYLSQRLEIYVRNNLFFDIIHCLIPLA ■ 
FYSSYQLKNIIVAHETILNPIMLSLFKLRFLRLLRFNDLIIEIYYNSKEKNLILIAFARTFSMSLLIPFTFFIIIS 
SSKIVNSIPEKQEFNIIKNISIINEKAYIKEKYPFILIIKEKDDIIYSKSDEIFVYYSPSEYRVIEMSKTKFYIDK 
YLQRKSDSILGIFLFTLFASFTIFLMNFYKFFKASFLNPIILMTKILQDPLEYRKIQIPFTLSEEKVYELAKSFNN " 
LLLKEKLNSKRKSKIPLEIEKVKKIINKNQEIK 



£208 .nt 
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ATGGTAAAAAAATTTTCAATTTTCTTAAAAGCAATAATAATTTTTTCAATATTTGAACTTTTAATCGAAGAACTCT 

CAATAATTCTTTTTTTACCATACAAAATACGATTTGCACTAATATTTCTTGGGTTTCTATTTGACACAATTTTTAT 

TTTCATTTTTTTATACAAAATAACCAAGGCCTACCTTTCCCAAAGATTAGAAATCTACGTCAGAAACAATCTATTC 

TTCGATATAATCCACTGCCTTATTCCTTTAGCGTTTTATAGCTCATATCAGCTTAAAAACATAATTGTCGCGCATG 

AAACAATATTAAATCCAATAATGCTATCACTTTTCAAGTTAAGATTTTTAAGACTTCTTAGGTTTAATGACCTAAT 

AATAGAAATATATTACAATTCAAAAGAAAAGAACCTAATACTAATAGCATTTGCTAGGACATTTTCAATGAGCTTA 

TTAATACCATTTACATTTTTTATAATAATATCAAGCTCAAAAATTGTAAATTCAATACCAGAAAAACAAGAATTTA 

ATATCATTAAAAATATATCAATAATAAATGAAAAAGCTTACATTAAAGAAAAATATCCCTTCATCTTAATAATCAA 

GGAAAAAGATGACATAATATACTCAAAATCAGACGAAATATTTGTTTACTACAGTCCCAGTGAATATAGAGTAATA 

GAAATGGAGAAAACAAAATTTTATATAGATAAATATTTGCAAAGAAAAAGCGATTCTATTCTTGGAATTTTTCTAT 

TTACATTGTTTGCATCATTTACTATTTTTTTAATGAATTTTTATAAATTTTTTAAAGCAAGCTTTTTAAATCCTAT 

TATTTTAATGACAAAAATTTTACAAGACCCATTAGAATATCGAAAAATTCAAATTCCTTTTACTTTAAGCGAAGAA 

AAAGTATATGAACTTGCAAAATCATTTAACAATCTCTTGCTAAAAGAAAAACTA^ 

T AC C TTTAGAAATTGAAAAAGTAAAAAAAATAATT AAT AAAAACCAGGAAAT AAAATGA 

t208.nt 

ATAATAATTTTTTCAATATTTGAACTTTTAATCGAAGAACTCTCAATAATTCTTTTTTTACCATACAAAATACGAT 

TTGCACTAATATTTCTTGGGTTTCTATTTGACACAATTTTTATTTTCATTTTTTTATACAAAATAACCAAGGCCTA 

CCTTTCCCAAAGATTAGAAATCTACGTCAGAAACAATCTATTCTTCGATATAATCCACTGCCTTATTCCTTTAGCG 

TTTTATAGCTCATATCAGCTTAAAAACATAATTGTCGCCCATGAAACAATATTAAATCCAATAATGCTATCACTTT 

TCAAGTTAAGATTTTTAAGACTTCTTAGGTTTAATGACCTAATAATAGAAATATATTACAATTCAAAAGAAAAGAA 

CCTAATACTAATAGCATTTGCTAGGACATTTTCAATGAGCTTATTAATACCATTTACATTTTTTATAATAATATCA 

AGCTCAAAAATTGTAAATTCAATACCAGAAAAACAAGAATTTAATATCATTAAAAATATATCAATAATAAATGAAA 

AAGCTTACATTAAAGAAAAATATCCCTTCATCTTAATAATCAAGGAAAAAGATGACATAATATACTCAAAATCAGA 

CGAAATATTTGTTTACTACAGTCCCAGTGAATATAGAGTAATAGAAATGGAGAAAACAAAATTTTATATAGATAAA 

TATTTGCAAAGAAAAAGCGATTCTATTCTTGGAATTTTTCTATTTACATTGTTTGCATCATTTACTATTTTTTTAA 

TGAATTTTTATAAATTTTTTAAAGCAAGCTTTTTAAATCCTATTATTTTAATGACAAAAATTTTACAAGACCCATT 

AGAATATCGAAAAATTCAAATTCCTTTTACTTTAAGCGAAGAAAAAGTATATGAACTTGCAAAATCATTTAACAAT 

CTCTTGCTAAAAGAAAAACTAAACTCAAAGCGAAAAAGCAAAATACCTTTAGAAATTGAA^ 

TTAAT AAAAACCAGGAAAT AAAATGA 



f2l0.aa 

MKIQIIIMLLALLDFPLNARLLDISIEKRADEEIKKYSSYNLILEKEYYTNFPTSEIEKNIYKLTEHFVKSIMLNK 
T^SLLNSNYKEANKYLIQSELIDKKFLKYKIFKIKNINGIFKSHSLIYTKKGFYKLELYIE^AEPLKIFNLNIT 
YFLKNLDKISNEMIFFFREKREVNMIQ 

IDTAADGEEAVIKYKNHYPNIDIVTLDITMPKMDGITCLSNXM 

KPLDRAKVLQRVMSVFVK. ■ 



t210.aa ■ * 

RLLDISIEKRADEEIKKYSSYNLILEKEYYTNFPTSEIEKNIYKLTEHFVKSIMLNKTNYSLLNS^^^KEANKYLIQ 
SELIDKKFLKYKIFKIKNINGIFKSHSLIYTKKGFYKLELYIENNAEPLKIFNLNITYFLKNLDKISNEMIFFPRE 

KREVNMIQKTTIAADSSSKPRGINYDTGIPF^^ 
NIDIVTLDITMPKMDGITCLSNIMEFDKNAR 

£210. nt 

ATGAAAATTCAAATAATTATAATGCTGCTTGCATTGTTAGATTTTCCACTTAATGCCAGACTTTTGGACATTTCAA 
TTGAAAAAAGAGCAGATGAAGAAATAAAAAJ^-TATTCGTCTTATAATTTAAT^ 

TTTTCC AAC AAGCGAAATAGAAAAAAAT ATTT AT AAAC TAAC AGAAC ATTTTGTAAAAAGC AT AATGC TC AAT AAA 
AC T AAC T AC AGCTT ATTAAATTC AAAC T AC AAAGAAGC AAJVT AAAT ATCT AATTC AJ^-AGCG AAC TC ATTGAT AAAA 
AATTTTTAAAJVTATAAAJ^TATTTAAAATCAAAAATATAAATGGAJ^TTTTTAAAAGC C ATTC AC T AAT AT AT AC AAA 
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AAAAGGATTTT AC AAATT AGAAC TTT AC AT AG AAAATAATGC AGAAC C TC TAAAAATATTT AAC C TTAAC ATTACT 
TATTTTTTAAAGAATTTAGATAAAATAAGTAATGAAATGATTTTTTTCCCAAGGGAATGA 

t210.nt 

AGACTTTTGGACATTTCAATTGAAAAAAGAGCAGATGAAGAAATAAAAAAATATTCGTCTTATAATTTAATTTTAG 
AAAAAGAATACTATACCAATTTTCCAACAAGCGAAATAGAAAAAAATATTTATAAACTAACAGAACATTTTGTAAA 
AAGCATAATGCTCAATAAAACTAACTACAGCTTATTAAATTCAAACTACAi^GAAGCAAATAAATATCTAATTCAA 
AGCGAACTCATTGATAAAAAATTTTTAAAATATAAAATATT.TAAAATCAAAAATATAAATGGAATTTTTAAAAGCC 
ATTC AC TAAT AT ATACAAAAAAAGGATTTT AC AAATT AG AAC TTT AC AT AG AAAATAATGC AGAAC CTCTAAAAAT 
ATTTAACCTTAACATTACTTATTTTTTAAAGAATTTAGATAAAATAAGTAATGAAATGATTTTTTTCCCAAGGGAA 
TGA 

f22.aa . 

MLKTLTKIITISCLIVGCASLPYTPPKQNLOTLMELLPGANLYAHVNLIKNRSIYNSLSPKYKSVLGLISNLYFSY 
KKENNDFALLIMGNFPKDIFWGIHKNRNTESIGNIFTNPKW 

KYIGEIEKNEMFFWIQDPTLLLPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVL 
TNMTNLTISSHIKTTIKDQNTVEIEFNIQKSSVESLIEKLASNIQT 

t22 . aa • 
PYTPPKQNLNYLMELLPGANLYAHWLIKNRSIYNSLSPKYKSVLGLISNLYFSYKKENN 

" GIHKNRKTES IGNI FTNPKWKLKNSNI YI I PNKARTS I AITQKDITAKDNNMLTTKYIGEI EKNEMFFWIQDPTLD 
LPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVLTNMTNLTISSHIKTTIKDQNT 
VEIEFNIQKSSVESLIEKLASNIQT 

.f22.nt .•'.*. ^ 

ATGTTAAAAACATTAACAAAAATAATTACCATTTCATGCCTCATAGTGGGATGCGCAAGCCTGCCTTACACTCCTC 

CAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGGCGCAAATTTATACGCCCATGTAAATTTAATTAAAA^ 

CAGGTCTATTTATAACTCTTTAAGCCCTAAATATAAATCAGTTCTTGGGCTTATAAGCAATTTATACTTTAGCTAT 

AAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTCCCAAAAGATATTTTCTGGGGAATT 

ATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAACTTAAAAATTCAAATATATACATTAT 

TCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAAAGATATAACCGCAAAAGACAATAATATGCTAACAACA 

AAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTCAAGATCCAACATTATTGCTCCCAAACCA 

TAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAAACAGCTTAAATCAAGAAGAATATAT 

TTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAJy^TATTGTCAAAAAAGTTAATTCCAAC 

ACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAAGACCAAAATACGGTTGAAATAGAAT 

TTAAT ATTC AAAAATCTAGTGTTGAAAGCCTT AT AGAAAAACTAGCTTCAAAT ATTC AAAC CTAA 

t.22 .nt • 

ccttacactcctccaaaacaaaatctaaattacttaatggaacttttacctggcgcaaatttatacgcccatgtaa 
atttaattaaaaacaggtctatttataactctttaagccctaaatataaatcagttcttgggcttataagcaattt 
atactttagctataaaaaagaaaataacgattttgctctactaataatgggtaatttcccaaaagatattttctgg 
ggaattcataaaaatagaaatacagaatcaataggcaatatatttacaaat^ 

atatatacattattccaaacaaagctagaactagcattgcaataacccaaaaagatataaccgcaaaagacaataa 
tatgctaacaacaaaatatattggggaaatagaaaaaaatgaaatgtttttttggattcaagatccaacattattg 
ctcccaaaccaaatagtaagcagcaaaaatttaattccctttagcagtggaactttgtctataaacagcttaa-atc 
aagaagaatatatttttaaatccttaatcaaaacaaataatcca.ccaatactaaaaatattgtcaaaaa^ 

TC C AAC CGTC TTGAC AAAC ATGAC AAAC C TC AC AATATC AAGC C AC AT AAAG ACC AC AATAAAAGACC AAAAT ACG 
GTTGAAATAGAATTTAATATTCAAAAATCTAGTGTTGAAAGCCTT AT AGAAAAACTAGCTTCAAAT ATTCAAACCT 

AA ' • 



f 221 .aa 
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MGITVFYLFSIFASFVLGSSMDSVKENVLKSTIFYYDVEEVEFPYARKQTLQFIAKTHLKYAVFNFDKNKMFSYTF 
VFDKKL I SQ YAI F I EVKKKFGEATLVTP LNYLWDLGDS I IVLNKNI LRITLKS YI SNYNK ■ ■ 

t221.aa 

SMDS VKENVLKST I F YYDVEEVEF P YARKQTLQF I AKTHLKYAVFNFDKNKMF SYTF VFDKKL I SQYAI F I EVKKK 
FGEATLVTPLNYLWDLGDS I I VLNKNI LRITLKS Y I SNYNK 

f221.nt 

ATGGGTATTACAGTTTTTTATTTATTTTCTATTTTTGCATCTTTTGTTCTGGGTTCTAGCATGGATTCTGTTAAAG- 

AGAATGTTCTCAAGAGCACTATTTTTTATTATGATGTTGAAGAAGTTGAATTTCCTTATGCTAGGAAGCAGACTTT 

ACAATTTATTGCTAAAACCCATTTAAAATATGCTGTTTTTAATTTTGACAAAAATAAAATGTTTTCGTACA 

GTTTTTGATAAAAAATTAATATCTCAGTATGCAATTTTTATTGAGGTAAAGAAAAAGTTTGGCGAGGCTACACTAG 

TAACGCCTTTGAATTATTTATGGGATCTTGGTGATTCTATTATTGTTTTAAATAAAAATATTTTAAGAATTACTTT 

AAAATCTTATATTTCAAATTATAATAAATGA 

t221.nt 

' AGCATGGATTGTGTTAAAGAGAATGTTCTCAAGAGCACTATTTTTTATTATGATGTTGAAGAAGTTGAATTTCCTT 
ATGCTAGGAAGCAGACTTTACAATTTATTGCTAAAACCCATTTAAAATATGCTGTTTTTAATTTTGACAAAAATAA 
AATGTTTTCGTACACTTTTGTTTTTGATAAAAAATTAATATCTCAGTATGCAATTTTTATTGAGGTAAAGAAAAAG 
TTTGGCGAGGCTACACTAGTAACGCCTTTGAATTATTTATGGGATCTTGGTGATTCTATTATTGTTTTAAATAAAA 
ATATTTTAAGAATTACTTTAAAATCTTATATTTCAAATTATAATAAATGA . 

"f253*.aa • 

MYMENIEVRGQPNFFGLIPFFVFIIIYLGTGIYLGVIGVEMAFYQLPASVAMFFASIVCFLVFKGKFSDKIHIFIK 
GAAQYDIILMCLIFMLSGAFSSLCKEIGCVETVANLGIKYINPNWIVSGIFFVTCFLSFSAGTSVGSIVAIAPIAF 
NIAVKSGINPNLIAASVMCGAMFGDMLSLISDTTIVSSRTQGSSILDVFISSSFYAFPSAILTFFSFFFLSENLSN 
ATNFLHESSIDLVKTVPYLMIIFFSLAGMNVFIVLFLGILSICLISVLYGNLYFL 

LTGGVSFAVIHNGGFKWLLIKLKSLIRGKSSAEFSIGAFVSIVDVFLANNTIAILICGKVAKKIAFENNISVQRSA 
SILDMFSCIFQGIIPYGAQMIILVNFSNGLVSPISILPFLVYFGFLLFFVILSILGLDIKKVFLFFLKK 

t253 .aa •* * 

LVFKGKFSDKIHIFIKGAAQYDI ILMCLIFMLSGAFSSLCKEIGCVETVANLGIKYINPNWIVSGIFFVTCFLSFS- 
AGTSVGSIVAIAPIAFNIAVKSGINPNLIAASVMCGAMFGDNLSLISDTTIVSSRTQGSSILDVFISSSFYAFPSA 
ILTFFSFFFLSENLSNATNFLHESSIDLVKTVPYLMIIFFSLAGMNVFIVLFLGILSICLISVLYGNLYFLDVMKN 

INKGFLNMADLIFLSILTGGVSFAVIHNGGFKWLLIK^^ 

AKKI AFENNI S VQRSAS I LDMF SC I FQG 1 1 P YGAQMI I L VNF SNGLVS PIS I LPF LVYTFGF LLFFVI LS I LGLDI K 
KVFLFFLKK ; . . 

f253 .nt ' • 

ATGTATATGGAAAATATTGAAGTAAGAGGGCAGCCAAATTTTTTTGGGCTTATTCCTTTTTTTGTTTTTATTATTA 
TCTATTTAGGCACGGGGATTTATTTGGGAGTTATTGGTGTAGAAATGGCCTTTTATCAACTGCCGGCTAGTGTTGC 
AATGTTTTTTGCTTCCATTGTTTGTTTTTTGGTATTTAAAGGAAAATTTTCCGACAAAATTCACATATTTATTAAA 
GGAGCAGCTCAGTACGATATTATACTAATGTGTCTTATTTTTATGCTTTCGGGAGCTTTCTCTTCTCTTTGTAAAG 
AAATAGGCTGCGTTGAAAGTGTAGCAAATTTGGGAATTAAATATATTAATCCTAATTGGATTGTTTCTGGTATATT 
TTTTGTAACCTGCTTTCTTTCTTTTTCTGCCGGCACTTCTGTTGGATCTATCGTTGCAATTGCTCCTATTGCTTTT 
AATATTGCTGTTAAAAGCGGCATTAATCCGAATTTAATAGCAGCATCTGTAATGTGTGGAGCTATGTTTGGAGATA 
ATCTTTCTTTAATATCAGATACAACTATTGTTTCTAGTCGAACTCAAGGTAGTAGCATCTTAGATGTTTTTATTAG 
TAGCAGTTTTTATGCTTTTCCATCCGCCATACTAACTTTTTTTTCTTTTTTCTTTCTTTCTGAAAATTTGTCCAAT 
GCCACAAACTTTTTACACGAAAGTTCAATAGATTTAGTGAAAACTGTGCCTTATTTAATGATTATATTTTTCTCTT 
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TAGCTGGAATGAATGTTTTTATAGTTCTTTTTTTAGGTATTCTTTCTATATGTCTTATTAGCGTTTTGTATGGTAA 
TTTATACTTTCTAGATGTAATGAAAAACATTAATAAAGGGTTTTTAAATATGGCGGATTTGATTTTTCTTTCAATT 
TTAACAGGGGGAGTTTCTTTTGCCGTGATTCATAATGGAGGCTTTAAATGGCTACTTATTAAATTAAAATCCTTGA 
TTAGAGGAAAAAGTTCAGCGGAATTTTCTATTGGGGCTTTTGTTTCAATAGTTGATGTTTTTCTTGCTAATAACAC 
AATTGCCATACTTATTTGCGGCAAAGTAGCAAAAAAGATAGCTTTTGAAAATAACATCAGTGTTCAAAGAAGTGCT 
TCTATTTTAGATATGTTCTCTTGTATTTTTCAAGGCATTATTCCTTATGGTGCGCAAATGATTATTTTAGTGAATT 
TTTCAAATGGACTTGTGTCGCCAATTAGTATTTTGCCATTTTTAGTTTATTTTGGATTTTTATTGTTTTTTGTTAT 
TTTATCTATTTTGGGCCTTGATATAAAAAAAGTTTTTTTATTTTTTTTAAAAAAATAA 

t253 .nt 

TTGGTATTTAAAGGAAAATTTTCCGACAAAATTCACATATTTATTAAAGGAGCAGCTCAGTACGATATTATACTAA 
TGTGTCTTATTTTTATGCTTTCGGGAGCTTTCTCTTCTCTTTGTAAAGAAATAGGCTGCGTTGAAACTGTAGCAAA 
TTTGGGAATTAAATATATTAATCCTAATTGGATTGTTTCTGGTATATTTTTTGTAACCTGCTTTCTTTCTTTTTCT 
GCCGGCACTTCTGTTGGATCTATCGTTGCAATTGCTCCTATTGCTTTTAATATTGCTGTTAAAAGCGGCATTAATC 
CGAATTTAATAGCAGCATCTGTAATGTGTGGAGCTATGTTTGGAGATAATCTTTCTTTAATATCAGATACAACTAT 
TGTTTCTAGTCGAACTCAAGGTAGTA.GCATCTTAGATGTTTTTATTAGTAGCAGTTTTTATGCTTTTGCATGCGCC 
ATACTAACTTTTTTTTCTTTTTTCTTTCTTTCTGAAAATTTGTCCAATGCCACAAACTTTTTACACGAAAGTTCAA 
TAGATTTAGTGAAAACTGTGCCTTATTTAATGATTATATTTTTCTCTTTAGCTGGAATGAATGTTTTTATAGTTCT 
TTTTTTAGGTATTCTTTCTATATGTCTTATTAGCGTTTTGTATGGTAATTTATACTTTCTAGATGTAATGAAAAAC 
ATTAATAAAGGGTTTTTAAATATGGCGGATTTGATTTTTCTTTCAATTTTAACAGGGGGAGTTTCTTTTGCCGTGA 
TTC AT AATGGAGGC TTT AAATGGC T AC TTATT AAATT AAAATC C TTG ATT AG AGGAAAAAGTTC AGC GGAATTTTC 
TATTGGGGCTTTTGTTTCAATAGTTGATGTTTTTCTTGCTAATAACACAATTGCCATACTTATTTGCGGCAAAGTA 
GCAAAAAAGATAGCTTTTGAAAATAACATCAGTGTTCAAAGAAGTGCTTCTATTTTAGATATGTTCTCTTGTATTT 
TTCAAGGCATTATTCCTTATGGTGCGCAAATGATTATTTTAGTGAATTTTTCAAATGGACTTGTGTCGCCAATTAG 
TXTTTTGCCATTTTTAGTTTATTTTGGATTTTTATTGTTTTTTGTTATTTTATCTATTTTGGGCCTTGATATAAAA 
AAAGTTTTTTTATTTTTTTTAAAAAAATAA ■ 

£265. aa^ 

MRKCFVSLSLLLIFFACSSNVEIELNDDISGIVSIFVNVNREFEKIRKELLTTLVGEEIANMPLFPVDEIKKYFKN 

GEEKLGLKLLSIKTQGDSINLWKFDNLIKILGDYMKKPDISVFKIEKKDGKNIIELNINLENATKNINENKEYIS 

DALAALLPSDEIPMSAKEYKDVLVY^ 

LS LET P I KLRLV 

Y * 

t265.aa 

SOTEIELNDDISGIVSIFVNVNREFEKIRKELLTTL^^ 

INLVVTCFDNLIKILGDYMKKPDISVFKIEKK^^ 

YKDVTjVYFLSDFTSKASELIDNSKLNLVVKTSRIW 

f265.nt ' ' . 

ATGAGAAAGTGTTTTGTTAGCTTGAGTTTATTGTTGATTTTTTTTGCTTGTAGCTCTAATGTTGAAATTGAGTTAA 

ATGATGATATTAGTGGTATTGTTTCAATATTTGTTAATGTTAATAGAGAATTTGAAAAAATTAGAAAAGAACTCTT 

AACAACTTTGGTGGGAGAAGAAATTGCAAATATGCCTGTTTTTCCTGTAGATGAAATAAAAAAAT 

GGAGAGGAAAAGCTTGGGCTTAAGCTTTTGAGTATTAAAACCCAAGGAGATTCTATTAATTTAGTTGTTAAGTTTG 

ATAATTTAATT AAAATTTT AGGCG ATT AT ATGAAAAAAC C C GAT AT ATC TGTGTTT AAGAT AGAAAAAAAAG ATGG 

TAAAAATATTATTGAACTTAATATTAATTTGGAAAACGCTACTAAGAATATTAATC 

GATGCACTTGCTGCTCTTTTGCCATCGGATGAGATCCCAATGTCTGCCAAAGAATATAAAGATGTTTTGGTTTATT 
TTTTATCGGATTTTACTTCCAAAGCAAGTGAACTTATTGACAATTCCAAACTTAATCTTGTAGTTAAGACTTCTAG 
AAATGTTCAAGAACAATTTGGATTCAAACAAATTAACTCAAACACACTGCGGTTTGAGATGGA.TATGGTTAAAGGA 
TTAAGTCTTGAAACACCAATAAAACTTAGATTAGTTTATTGA 



t265 .nt 
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• TCTAATGTTGAAATTGAGTTAAATGATGATATTAGTGGTATTGTTTCAATATTTGTTAATGTTAATAGAGAATTTG 
AAAAAATTAGAAAAGAACTCTTAACAACTTTGGTGGGAGAAGAAATTGCAAATATGCCTCTTTTTCCTGTAGATGA 
AATAAAAAAATACTTTAAAAATGGAGAGGAAAAGCTTGGGCTTAAGCTTTTGAGTATTAAAACCCAAGGAGATTCT 
ATTAATTTAGTTGTTAAGTTTGATAATTTAATTAAAATTTTAGGCGATTATATGAAAAAACCCGATATATCTGTGT 
TTAAGATAGAAAAAAAAGATGGTAAAAATATTATTGAACTTAATATTAATTTGGAAAACGCTACTAAGAATATTAA 
TGAAAATAAAGAATATATTAGTGATGCACTTGCTGCTCTTTTGCCATCGGATGAGATCCCAATGTCTGCCAAAGAA 
TATAAAGATGTTTTGGTTTATTTTTTATCGGATTTTACTTCCAAAGCAAGTGAACTTATTGACAATTCCAAACTTA 
ATCTTGTAGTTAAGACTTCTAGAAATGTTCAAGAACAATTTGGATTCAAACAAATTAACTCAAACAGACTGCGGTT 
TGAGATGGATATGGTTAAAGGATTAAGTCTTGAAACACCAATAAAACTTAGATTAGTTTATTGA 

f269.aa 

MNIRKLLFCIFFMNISFLLFAGDYKGLDFKIKFFNQSIYRVNSNVFIEVSLSNASESVLTLEIGDINSFGFDFDVT 
DTTNIKVKRPIEYVKKRSKIWAIPVRN^ 

TLFLNDGFDENPGSIDLVNLSENNDIQDILKKKKLSPDEIVKYLLKALQLGKKEKFFLYLDIEGLLLNDKGKAYLY 
KQKLSPIPNKNWEEYKEYLWNSNNSDTSI^^ 

WIIYDYIVQNTGTKEK " - ' 

t2 6 9.aa . 

GDYKGLDFKIKFFNQSIYRVNSWFIEVSLSNASESVLTLEIGDINSFGFDFDVTDTTNIKVKRPIEWKKRSKW 
AIPVRNMSLRPNEKFSWINLNQFVKFSKDGVYFVKGIFFPDISDPSKKKESNIITLFLNDGFDENPGSIDLVNLS 
ENNDIQDILKKKKLSPDEIVKYLLKALQLGKKEKFFLYLDIEGLLLNDKGKAYLYKQKLSPIPNKNWEEYKEYLW 
NSNNSDISKAPNKFSIIETTYSDTSGKyiADLYFDDGQFYISKRYTFFFKKYDYYWIIYDYIVQNTGIKEK 

f269.nt - 

< 

ATGAATATTAGAAAATTGCTTTTTTGTATCTTTTTTATGAATATTTCTTTTCTTTTGTTTGCGGGAGATTACAAGG 
GCCTTGATTTTAAAATCAAGTTTTTTAATCAATCTATTTATGGTGTCAATAGTAATGTTTTTATTGAAGTTTCTCT 
TAGTAATGCGTCTGAGAGTGTTTTAACTTTAGAAATAGGCGATATTAATTCTTTTGGCTTTGATTTTGATGTTACT 
GATACCACCAATATTAAAGTTAAAAGACCTATTGAATATGTTAAAAAGAGATCTAAAAATGTTGCAATTCCTGTTA 
GAAATATGAGCTTGAGACCTAATGAAAAATTTTCTGTAGTTATTAACTTAAATCAATTTGTTAAGTTTAGTAAAGA 
TGGAGTTTATTTTGTTAAGGGTATTTTTTTCCCAGACATTTCAGATCCATCTAAGAAAAAAGAATCCAATATTATT 
ACGCTTTTTTTGAATGATGGTTTTGATGAAAATCCAGGTAGCATAGACCTTGTTAATTTGTCTGAAAATAATGATA 
TTCAAGATATCTTGAAAAAGAAAAAATTATCTCCCGATGAAATTGTTAAATATTTGTTAAAGGCATTGCAGCTTGG 
GAAAAAAGAAAAGTTCTTTTTATATCTTGATATTGAAGGTTTGTTATTAAATGACAAGGGCAAGGCATACCTTTAT 
AAGC AAAAGTT ATC AC CT ATTC C C AAT AAAAATGTAGTTGAAGAGTAT AAAGAATATTTGTGGAATTC T AAT AATT 
CGGATATTTCAAAAGCACCAAATAAATTTTCTATTATTGAAACTACTTATTCTGATACTTCTGGCAAGGTGATTGC 
TGATTTATATTTTGACGATGGGCAATTTTATATTTCCAAAAGATATACTTTCTTCTTTAAAAAATATGATTATTAT 
TGGATAATTTATGATTACATTGTTCAAAATACTGGCATTAAGGAAAAGTAA 

t269.nt . " , 

GGAGATTACAAGGGCCTTGATTTTAAAATCAAGTTTTTTAATCAATCTATTTATCGTGTCAATAGTAATGTTTTTA 

TTGAAGTTTCTCTTAGTAATGCGTCTGAGAGTGTTTTAACTTTAGAAATAGGCGATATTAATTCTTTTGGCTTTGA 

TTTTGATGTTACTGATACCACCAATATTAAAGTTAA^GACCTATTGAJ^TATGTTAAAAAGAGATCTAAAAATGTT 

GCAATTCCTGTTAGAAATATGAGCTTGAGACCTAATGAAAAATTTTCTGTAGTTATTAAGTTAAATCAATTTGTTA 

AGTTTAGTAAAGATGGAGTTTATTTTGTTAAGGGTATTTTTTTCCCAGACATTTCAGATCCATCTAAGAAAAAAGA 

ATCCAATATTATTACGCTTTTTTTGAATGATGGTTTTGATGAAAATCCAGGTAGCATAGACCTTGTTAATTTGTCT 

GAAAATAATGATATTCAAGATATCTTGAAAAAGAAAAAATTATCTCCCGATGAAATTGTTAAATATTTGTTAAAGG 

CATTGCAGCTTGGGAAAAAAGAAAAGTTCTTTTTATATCTTGATATTGAAGGTTTGTTATTAAATGACAAGGG 

GGCATACCTTT AT AAGC AAAAGTT ATCACCTATTCCCAATAAAAATGTAGTTGP^GAGT AT AAAGAATATTTGTGG 

AATTCTAATAATTCGGATATTTCAAAAGCACCAAATAAATTTTCTATTATTGAAACTACTTATTCTGATACTTCTG 

GCAAGGTGATTGCTGATTTATATTTTGACGATGGGCAATTTTATATTTCCAAAAGATATACTTTCTTCTTTAAAAA 

ATATGATTATTATTGGATAATTTATGATTACATTGTTCAAAATACTGGCATTAAGGAAAAGTAA 



117 * 

TABLE 1. Nucleotide and Amino Acid Sequences 
f2 9.aa 

MNWIjSFFYVLLFLLIFPFELQSNNKENIENLIKLHMLYDLTNNLSKELETINKIKNFDLEQHYLLITKYYLKIKKY 
KEA1TOFLKKINQKKIKNQKIKNEIISLKLRINEDNINEEEIKKILNNEKNIDVKIIYQIFSLIKFKNKKLANKIKN 
' IILTNYPKSIYSYKIKRNE 



t29.aa 

NNKENIENLIKLHMLYDLTNNLSKELETINKIKNFDLEQHYLLITKYYLKIKKYKEANDFLKKINQKKIKNQKIKN 
EIISLKLRINEDNINEEEIKKILNNEKNIDVKIIYQIFSLIKFKNKKLANKIKNIILTNYPKSIYSYKIKRNE 

f29.nt 

ATGAACTGGCTATCCTTTTTTTATGTTTTATTATTTTTATTAATTTTTCCTTTTGAATTACAGAGTAATAATAAAG 

AAAAT ATAG AAAATTT AAT AAAGC T AC AT ATGCTTT ATG ATTT AAC C AAT AAC C TGTC AAAAG AATT AGAAAC AAT 

AAATAAAATTAAAAATTTTGACTTAGAACAACATTATCTGCTAATTACAAAATATTATCTAAAAATAAAAAAATAT 

AAAGAAGCTAATGATTTTTTAAAAAAAATAAACCAAJVAAAAGATCAAAAATCAA 

CGCTAAAATTAAGAATAAATGAAGATAATATTAATGAAGAAGAAATCAAAAAAATTTTAAAT 

AGATGTCAAAATAATTTATCAAATATTCAGTCTTAT AAAATTT AAAAATAAAAAATTAGCAAATAAAATT 

AT AAT ACTAACAAACTATCCCAAAAGCATTTATTCTTAT AAAAT AAAAAGAAATGAATAA 

t29 .nt 

AATAATAAAGAAAATATAGAAAATTTAATAAAGCTACATATGCTTTATGATTTAACC 

TAGAAACAATAAATAAAATTAAAAATTTTGACTTAGAACAACATTATCTGCTAATTACAAAATATTATCTAAA 
AAAAAAATATAAAGAAGCTAATGATTTTTTAAAAAAAATAAACCAAAAAAAGA 

GAAATCATTTCGCTAAAATTAAGAATAAATGAAGATAATATTAATGAAGAAGAAATCAAAAAAATTTT 
AAAAAAAT AT AGATGTCAAAATAATTTATCAAATATTCAGTCTT AT AAAATTT 

AATTAAAAACAT AAT ACTAACAAACTATCCCAAAAGCATTTATTCTTAT AAAAT AAAAAGAAATGAATAA 

f290.aa • * . 

MNSIYVIGKLLLTLFLIFFPFCYNLFAVNLAEINKLSEYAKSIVLXDFDTKRILYSKKPNLVFPPASLTKIVTIYT' 
ALIEAEKRNIKLKSIVPISDSASYYNAPPNSSLMFLEKGQIVNFEEILKGLSVSSGNDSSIAIAEFWGNLNSFW 
LMNINVLNLGLFNMHFVEPSGYSSENKITALDMAFFVKSYIEKFKFMLNIHSLKYFIYPKSRNLGTALSSKFLNL^ 
QRNANLLIYDYPYSDGIKTGYIKESGLNLVATAKKGERRLIAVVLGVEKGINGFGEKMRSS IAKNLFEYGFNKYSK 
FPLIVKLKEKVYNGTVDTVALFSKEPFYYILTKDEFDKINISYTVDKLVAPLSGDMPVGRAMIFLENEKIGDVALF 

S GKVKRLG F WQG LYKSFINLFS RE Y 
t.290 . aa 

VNLAEINKLSEYAKSIVLIDFDTKRILYSKKPNLVFPPASLTKIVTIYTALIEAEKRNIKLKSIVPISDSASYYNA 
PPNSSLMFLEKGQIVNFEEILKGLSVSSGNDSSIAIAEFVVGNLNS 

ITALDMAFFVKSYIEKFKFMLNIHSLKYFIYPKSRNLGTALSSKFLNLKQRNANLLIYDYPYSDGIKTGYIKESGL 
NLVATAKKGERRLIAWLGVEKGINGFGEKMRSSIAKNLFEYGFNKYSKFPLIVKLKEKVYNGTVDTVALFSKEPF 
YYILTKDEFDKINISYTVDKLVAPLSGDMPVGPJUyilFLENEKIGDVALFSGKVKRLGFWQGLYKSFINLFSREY • 

f 290 .nt. 

ATGAATAGTATCTATGTTATTGGGAAATTGTTATTAACTTTATTTTTAATTTTTTTCCCGTTTTGTTATAATCTTT 
TTGCAGTTAATTTAGCTGAGATTAATAAATTATGAGAGTATGCAAAGTCAATAGTTTTAATAGATTTTGATACTAA 
GCGAATACTTTATTCTAAGAAGCCCAATTTGGTTTTTCCTCCAGCATCTCTTACAAAGATTGTTACAATTTATACA 
GCTTTAATTGA^GCTGAAAAGCGAAATATAAAATTAAAAAGCATAGTTCCTATTAGCGATTCTGCTTCATATTATA 
ATGCACCCCCCAATTCTTCTTTGATGTTTTTAGAAAAAGGTCAAATTGTTAATTTTGAAGAGATTTTAAAAGGACT 
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TTCAGTTCCTTCGGGTAATGATTCTTCTATTGCAATTGCTGAGTTTGTAGTAGGCAATTTAAATAGCTTTGTTAAT 

t^IVtgLvtattaatgt 

i^l_ig^t"acac^actagatatggctttttttgtgaaatcttatatag 

qq t-t 1 z. ii/^CTTGTT 

GCTACTGCTAAAAAGGGTGAGAGAAGATTAATAGCAGTTGTATTGGGGGTTGAAAAAGGAAT 

?AAT^!^ 

TTCCCTTCAATAGTAAAATTAAAAGAAAAAGTCTATAATGGTAC 
CTTTT^AZTATATTTTAACTAAAGA 

ACT^AGTGG<^ATATGCCTGTTGGGAGGGCTATGATTTTTTTAGAAAATGAAAAAATA 

•agt<^;^.ggtaaaaag 

AA ' . 



c250.nc - - 

G^.aji'-^AGCTGAGATTAATAAATTATCAGAGTATGCAAAGTCAATAGTTTTAATAG 

t 2c™^-^ctaagaagcccaaot^ 
aattg^c^ 

CCCCCC ^-A -TTCTTCTTTGATGTTTTTAGAAAAAGGTCAAATTGTTAATTTTGAAGAGATTTTAAAAGGACTTTCAG 
TT^CTTO^TAATGATTCTTCTATTGCAATTGCTGAGTTTGTAGTAGGCAATTTAAATAGC 

gaatattL^^ 

at^acac^^xtagatatggctttttttgtgaaatcttatatagaaaagtttaaatttatgcttaatattcattctt 
tVJj^gtattttatttatccaaagagta^ 

L^TC^^-^.TTTATTAATATATGATTACCCTTATTCAGATGGCATTAAAAC 
aj^CT^G^GCTACTGCTAAAAAGGGTGAGAGAAGATTAATAGCAGTT 

g^ttc<^gagaaaatgag 

TT -^t- G ^AAAATT^^ 

rp^^ * a .^ TTTAAC taaagatgaatttgataaaattaatat AAGTTATACTGTTGAT AAATTGGTTGCTCC ACTTA . 
GTC^^ACATGCCTGTTGGGAGGGCTATGATTTTTTTAGAAAATGAAAAAATAGGGGATG 

CAAGGTAAAAAGATTAGGGTTTTGGCAAGGTCTTTATAAGAGTTTTATAAATCTTTTTTCAAGAGAGTATTAA 
f 291 .dd - ' * . ■ ' 

>^syd?i?alvpiiliiiglgiikkpayyv^ 

*T^<MSED<2KDIETIKNILSNVSSDRRIIVL^ 

TSSTAYGSVGIPITSIAQATNLDVNIVSSEIAFQLILPTLTIPFVLVILTGGGIKGLKG . 

FISKTLGPSLPAILGSILSMTITIVYARFFGNKETTERQSKNTISLSKGIIACSPYILIVTFIVLVSPLFNKIHEY 

LK^FQS^ZSIYPEANPLHFKWIISPGFLIILATTISYSIRGVPMLKQLKIFTLTLKKMALSSFIIICIVAISRLMT 

HSGI4I5X)LAJSIGISIITGKFGPLFSPLIGAIGTFLTGSDTVSNVLFGPLQTQMAENIGA 

I S PQNIT I ATTTAGL IGQEGKLLSKTI IYALYY I LATGLLVYL V 

.c2Sl.aa- * ' ' ' '. 

QKDIETIKNILSNVSSDRRIIVLLVAWGFGNFLEGVAGYGTAVAIPVSILIAMGFEPFFACIjICLIMNTSSTAYGS 

VGI P ITS LAQ ATNLDVNI VS S E I AFQL I L PTLT I PFVLVI LTGGGI KGLKGVFLLTLLSGMSMAI SQVF I SKTLGP 
ELPAILGSILSMTITIVYARFFGNKETTERQSKNTISLSKGIIACSPYILIWFIVLVSPLFNKIHEYLKTFQSTI 
SIYPEANPLKFKWI ISPGFLI ILATTISYSIRGVPMLKQLKIFTLTLKKMALSSFIIICIVAISRLMTHSGMIRDL 

ANGISIITCKFGPLFSPLIGAIGTFLTGSDTVSJ^ 

ATTTAGL ICQ SGKLLSKTI I YALYY I LATGLLVYL V . . ■ 

£291. nt 

ATGAATTCTTATGATTTTATAACAGCTTTGGTACCAATAATCCTAATAATTATTGGACTTGGCATAATAAAAAAGC 
C^GC^AC^ATGTAATACCCATATGATTAATAGCCACCGTTGCTATAGTTATATTTTATAAAAACTTGGGAATAGT 
^ C i G \i g^CTTGC AATGCTTGAGGGCGCGTT AATGGGGAT ATGGCCAAT AGCAACTGTAATT ATTGCTGCC ATA 
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TTTACATACAAAATGTCAGAAGATCAAAAAGATATAGAAACTATTAAAAATATTTTATCAAACGTATCTTCTGATA 
GAAGAATTATAGTATTACTAGTTGCATGGGGATTTGGAAATTTTTTAGAAGGAGTTGCTGGATATGGAACTGCTGT 
TGCAATTCCTGTATCAATATTAATAGCAATGGGATTTGAACCATTTTTTGCCTGCTTAATCTGTTTAATAATGAAC 
ACCTCATCAACCGCCTACGGATCTGTGGGAATCCCTATAACATCTTTAGCTCAAGCAACTAACTTGGATGTTAACA 

•TTGTTTCATCTGAGATTGCATTCCAACTAATACTTCCAACCTTAACAATACCTTTTGTACTGGTAATTCTTACAGG 
AGGGGGCATTAAAGGATTAAAAGGAGTATTCCTTCTTACCTTACTCTCAGGAATGTCAATGGCAATATCTCAAGTA 
TTTATATCAAAAACTTTGGGTCCAGAACTTCCTGCAATCCTTGGAAGCATTCTTTCTATGACAATAACAATAGTTT 
ATGCAAGGTTTTTTGGAAATAAAGAAACTACTGAGCGCCAAAGCAAAAACACAATATCCTTATCAAAAGGAATTAT 
TGCCTGCTCACCCTACATTTTAATAGTAACTTTTATAGTGCTTGTATCTCCTCTTTTTAACAAAATTCATGAATAC 
CTAAAAACTTTTCAAAGCACTATTAGCATTTATCCAGAAGCAAATCCCTTACACTTTAAATGGATTATCTCTCCGG 

• GCTTCTTGATTATACTTGCAACAACAATATCCTATTCAATACGGGGAGTTCCAATGTTAAAACAGCTAAAAATATT 
TACATTAACCTTGAAAAAAATGGCATTATCTTCCTTTATAATCATATGCATTGTTGCAATATCAAGATTAATGACA 
CATAGTGGAATGATAAGAGATCTTGCTAATGGAATCTCAATAATAACAGGTAAATTTGGACCATTATTTAGCCCAC 
TAATTGGAGCTATTGGGACATTTTTAACAGGAAGTGATACGGTTTCAAATGTTCTTTTTGGACCTTTACAAACACA 
AATGGCAGAAAATATTGGAGCAAATCCTTACTGGCTTGCAGCAGCAAATACAACAGGAGCAACTGGAGGGAAAATG 
ATTTCTCCCCAJVAACATCACAATAGCAACAACAACTGCTGGATTAATTGGACAAGAAGGCAAGCTTTTATCAAAAA 
CAATAATTTATGCTTTA^ACTACATTTTAGCAACAGGATTGCTAGTTTATTTAGTATAA 



t29l'.nt * 

CAAAAAGATATAGAAACTATTAAAAATATTTTATCAAACGTATCTTCTGATAGAAGAATTATAGTATTACTAGTTG 

CATGGGGATTTGGAAATTTTTTAGAAGGAGTTGCTGGATATGGAACTGCTGTTGCAATTCCTGTATCAATATTAAT 

AGCAATGGGATTTGAACCATTTTTTGCCTGCTTAATCTGTTTAATAATGAACACCTCATCAACCGCCTACGGATCT 

GTGGGAATCCCTATAACATCTTTAGCTCAAGCAACTAACTTGGATGTTAACATTGTTTCATCTGAGATTGCATTCC 

AACTAATACTTCCAACCTTAACAATACCTTTTGTACTGGTAATTCTTACAGGAGGGGGCATTAAAGGATTAAAAGG 

AGTATTCCTTCTTACCTTACTCTCAGGAATGTCAATGGCAATATCTCAAGTATTTATATCAAAAACTTTGGGTCCA 

GAACTTCCTGCAATCCTTGGAAGCATTCTTTCTATGACAATAACAATAGTTTATGCAAGGTTTTTTGGAAATAAAG 

AAACTACTGAGCGCCAAAGCAAAAACACAATATCCTTATCAAAAGGAATTATTGCCTGCTCACCCTACATTTTAAT 

AGTAACTTTTATAGTGCTTGTATCTCCTCTTTTTAACAAAATTCATGAATACCTAAAAACTTTTCAAAGCACTATT 

AGCATTTATCCAGAAGCAAATCCCTTACACTTTAAATGGATTATCTCTCCGGGCTTCTTGATTATACTTGCAACAA 

CAATATCCTATTCAATACGGGGAGTTCCAATGTTAAAACAGCTAAAAATATTTACATTAACCTTGAAAAAAATGGC 

ATTATCTTCCTTTATAATCATATGCATTGTTGCAATATCAAGATTAATGACACATAGTGGAATGATAAGAGATCTT 

GCTAATGGAATCTCAATAATAACAGGTAAATTTGGACCATTATTTAGCCCACTAATTGGAGCTATTGGGACATTTT 

TAACAGGAAGTGATACGGTTTCAAATGTTCTTTTTGGACCTTTACAAACACAAATGGCAGAAAATATTGGAGCAAA 

TCCTTACTGGCTTGCAGCAGCAAATACAACAGGAGCAACTGGAGGGAAAATGATTCTCCCCAAAACATCACAATAG 1 

CAACAACAACTGCTGGATTAATTGGACAAG 



f296,aa . - 

MPSPIRVFFLVLLFIFIFNPVLIAI^LFILFPFILILFSFLGVFRIYFTRDYSYSRSREFEFYKLSFLLMAKLLSIL 
GTVTGEQLNYVNF I INS LNLS ERGKS EL YT I FHS AITKNNNADKI L YTLKLGYFQHKDLF I WLF ATLKE I NRLSRY 
KNLEAEKFISYVGVFLELESDGYEAYKDINIKIVNPYSVLGLTYSASDDEVKKAYKSLVIKYHPDKFANDPVRQKD 

ANDKFIKIQDAYEKICKERNIR 
t296.aa 

IYFTRDYSYSRSREFEFYKLSFLLMAKLLSILGTVTGEQLNYVNFIINSLNLSERG 

I L YTLKLGYFQHKDLF I WLF ATLKE I I^LSRYKNLEAEKF I SYVGVFLELESDGYEAYKD I NIKIVNPYSVLGLTY 
SASDDEVKKAYKSLVIKYKPDKFANDPVRQKDAJJDKFIKIQDAYEKICKERNIR 

f296.nt ' 

ATGCCAAGCCCXATTAGAGTGTTTTTTTTAGTGTTGTTGTTTATTTTTATTTTTAATCCCGTTTTAATAGCAATGC 
TTTTTATTTTATTTCCTTTTATTTTGATATTATTTAGTTTTTTAGGTGTTTTTAGAATATACTTTACAAGGGATTA 
CTCATATTCTAGATCTAGAGAGTTTGAATTTTATAAACTTTCTTTTTTATTAATGGCTAAATTGCTATCTATTTTA 



120 

TABLE 1. Nucleotide and Amino Acid Sequences 



'ggaactgtaactggggagcagctaaattatgtcaattttattatcaattctttgaatttgtctgaacgtggtaaat 
cagaattgtataccatttttcattctgctattactaaaaataataatgctgataaaattttatatacccttaagct 
tggttattttcagcacaaagatctttttatatggctttttgccactcttaaagaaattaacaggctttctaggtat 

AAAAATTTAGAAGCTGAAAAATTTATTTCTTATGTTGGTGTTTTTTTAGAACTTGAATCTGATGGTTATGAAGCTT 
ATAAAGATATTAATATTAAAATTGTAAATCCTTATAGTGTTTTGGGGTTAACATATAGTGCTAGCGATGATGAGGT 
TAAAAAGGCGTATAAAAGCCTTGTTATAAAATATCATCCTGATAAGTTTGCAAATGATCCTGTAAGACAAAAAGAT 
GCAAATGATAAATTTATAAAAATTCAAGATGCTTATGAAAAAATTTGCAAGGAi^GAAATATAAGGTAA 

t296.nt 

A.TATACTTTACAAGGGATTACTCATATTCTAGATCTAGAGAGTTTGAATTTTATAAACTTTCTTTTTTATTAATGG 
CTAAATTGCTATCTATTTTAGGAACTGTAACTGGGGAGCAGCTAAATTATGTCAATTTTATTATCAATTCTTTGAA 
TTTGTCTGAACGTGGTAAATCAGAATTGTATACCATTTTTCATTCTGCTATTACTAAAAATAATAATGCTGATAAA 
ATTTTATATACCCTTAAGCTTGGTTATTTTCAGCACAAAGATCTTTTTATATGGCTTTTTGCCACTCTTAAAGAAA 
TTAACAGGCTTTCTAGGTATAAAAATTTAGAAGCTGAAAAATTTATTTCTTATGTTGGTGTTTTTTTAGAACTTGA 
ATCTGATGGTTATGAAGCTTATAAAjjATATTAATATTAAAATTGTAAATCCTTATAGTGTTTTGGGGTTAACATAT? 
AGTGCTAGCGATGATGAGGTTAAAAAGGCGTATAAAAGCCTTGTTATAAAATATCATCCTGATAAGTTTGCAAATG 
ATCCTGTAAGACAAAAAGATGCAAATGATAAATTTATAAAAATTCAAGATGCTTATGAAAAAATTTGCAAGGAAAG 
AAATATAAGGTAA " ' 

f3.aa 

MKKKNIjSIYMIMLISLLSCNTSDPNELTRKKMQDK^TVICILGFLEKIQADNKEIVEKHIEKK 
ESNFPYYLQEEIEIKEEELVPNTDEEKKAEKAISDGSLEFAKLTO^ 

AEVHVAGRINSYIKKRKTTKEKEYKKREIKNKIEKQALIKLFNQLLEKRGDIENLHTQLNSGLSERASAKYFFEKA 
. KETLKAAITERLNNKRKJSTRPWWARRTHSNLAIQAKNEAEDALNQLSTSSFRILEAMKIKEDVKQLLEEVKSFLDS^ 
KSKIFSSGDRLYDFLETSK ■ ' 

t3 .aa . ■' " ' 

NELTRKKMQDKNVKILGFLEKIQADNKEIVEKHIEKKEKQMVQAASVAPINVESNFPYYLQEEIEIKEEELVPNTD 

EEKKAEPCAISDGSLEFAKLVT)DENKLKNESAQLESSFNNVYKEILELADLIQAEVHVAGRIN 

KKREIKNKIEKQALIKLFNQLLEKRGDIENLHTQLNSGLS 

RTHSNLAIQAKNEAEDALNQLSTSSFRILEAMKXKEDVKQLLEEVKSFLDSSKSKIFSSGDRLiYDFLETSK 
f3.nt 

ATGAAAAAAAAAAATTTATCAATTTACATGATAATGCTAATAAGTTTATTATCATGTAATACAAGTGACCCCAATG 
• AATTAACTCGTAAAAAAATGCAAGACAAGAACGTGAAAATTTTAGGATTTTTAGAGAAAATTCAAGCAGATAATAA 
AGAAATTGTTGAAAAACATATAGAAAAAAAAGAAAAACAAATGGTGCAGGCTGCTTCTGTAGCACCTATTAATGTA 
GAGAGTAATTTCCCATATTATCTTCAAGAAGAAATAGAGATAAAAGAAGAAGAGTTGGTTCCAAATACTGATGAAG 
AAAAGAAGGCAGAGAAGGCAATTAGCGATGGGAGTCTTGAATTTGCTAAATTAGTTGATGATGAAAATAAACTTAA 
AAATGAATCTGCGCAATTAGAATCTAGTTTTAATAATGTTTATAAAGAAATCTTAGAACTTGCAGATTTAATACAA 
GCAGAGGTGCATGTTGCAGGAAGGATAAATAGCTATATAAAAAAAAGAAAGACCACTAAAGAAAAAGAATATAAGA 
AGAGAGAAATTAAGAATAAGATAGAAAAACAGGCTCTAATTAAGTTGTTCAATCAGTTATTAGAAAAAAGAGGCGA 
TATTGAAAATCTTCATACTCAATTAAATAGTGGACTTAGCGAGAGAGCATCTGCAAAATACTTTTTTGAGAAAGCC 
AAAGAAACTTTAAAAGCTGCTATTACTGAAAGATTAAATAACAAACGTAAAAATCGGCCATGGTGGGCAAGAAGAA 
CA.CATAGTAJ^TTTAGCAATACAGGCAAAAAATGAGGCAGAGGATGCTTTAAACCAATTAAGTACTTCTTCTTTTAG 
GATACTTGAAGCAATGAAAATAAAGGAAGATGTAAAACAGCTTCTTGAAGAAGTAAAATCTTTTCTAGATTCTTCA 
AAGAGCAAAATCTTTTCTAGTGGCGATAGATTATATGATTTTTTAGAGACGAGTAAATAA 

. t3 .nt 

^AATGAATTAACTCGTAAAAAAATGCAAGACAAGAACGTGAAAATTTTA.^ 
ATAAAGAAATTGTTGAAAAACATATAGAAAAAAAAGAAAAACAAATG^ 

TGTAGAGAGTAATTTCCCATATTATCTTCAAGAAGAAATAGAGATAAAAGAAGAAGAGTTGGTTCCAAATACTGAT 
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GAAGAAAAGAAGGCAGAGAAGGCAATTAGCGATGGGAGTCTTGAATTTGCTAAATTAGTTGATGATGAAAATAAAC 
TTAAAi^ATGAATCTGCGCAATTAGAATCTAGTTTTAATAATGTTTATAAAGAAATCTTAGAACTTGCAGATTTAAT 
ACAAGCAGAGGTGCATGTTGCAGGAAGGATAAATAGCTATATAAAAAAAAGAAAGACCA 

AAGAAGAGAGAAATTAAGAATAAGATAGAAAAACAGGCTCTAATTAAGTTGTTCAATCAGTTATTAGAAAAAAGAG 
GCGATATTGAAAATCTTCATACTCAATTAAATAGTGGACTTAGCGAGAGAGCATCTGCAAAATACTTTTTTGAGAA 
AGCCi^JVGAAACTTTAAAAGCTGCTATTACTGAAAGATTAAATJ^ 

AGAACACATAGTAATTTAGCAATACAGGCAAAAAATGAGGCAGAGGATGCTTTAAACCAATTAAGTACTTCTTCTT 
TTAGGATACTTGAAGCAATGAAAATAAAGGAAGATGTAAAACAGCTTCTTGAAGAAGTAAAATCTTTTCTAGATTC 
TTCAAAGAGCAAAATCTTTTCTAGTGGCGATAGATTATATGATTTTTTAGAGACGAGTAAATAA 

' f 30 .aa ' 

MNKKILTLLVTjILSISSVLMIiSKSITKKSKYKIIRDYFINSNYVLVKIENKDLKFTISKPIYDKKLNNYFFKGQTT 

SHFLISNNVDIAINTSPYEVKQNMFFPKGLYIYNKKMISKQINNYGEIVIKHNKIILNPKEDEIENCDYGFSGFFV 

LIKNGKYKKNFKETRHPRTIIGTDKNNKHLFLVTIEGRGV^ 

KSNNAPYKLNFTANIFGQERPVPFHLGIKLPN 

t3 0.aa 

LSKSITKKSKYKIIRDYFINSNWLVKIENKDLKFTISKPIYDKKLNNYFFKGQTTSHFLISNNVDIAINTSPYEV 
KQNMFFPKGLYIYNKKMISKQINNYG^^ 

IGTDKNNKHLFLVTIEGRGVNNSKGASLNEAIDFALSYGMTNAINLDGGGSSTLVVKSNNAPYKLNFTANIFG 
PVPFHLGIKLPN 

f30.nt ' 

ATGAATAAAAAAATATTAACACTGCTAGTATTGATTTTAAGTATTTCATCAGTACTAATGCTGTCCAAATCAATCA 
CCAAAAAATCCAAATACAAAATTATTAGGGATTATTTCATAAACAGCAATTATGTTC^^ 

AGATCTAAAATTTACCATATCAAAACCTATTTACGACAAAAAGCTAAATAATTACTTCTTTAAAGGCCAAACAACA 
AGCCATTTCTTAATTTCTAACAATGTTGACATTGCAATTAACACAAGTCCATACGAAGTTAAACAAAACATGTTTT 
TC C C AAAAGGAC T ATAC ATAT ATAAT AAAAAAATGATTTC AAAAC AAAT AAATAAC TAC GGAGAG ATTGT AAT AAA 
GCACAACAAAATTATATTAAATCCCAAGGAAGACGAAATAGAAAACTGCGATTATGGATTTAGCGGATTTTTTGTT 
TTAATCAAAAACGGAAAGTATAAAAAAAATTTTAAAGAAACAAGGCACCCAAGAACAATAATA 

ATAACAAGCATTTATTTCTTGTTACAATAGAAGGAAGGGGTGTCAATAATAGCAAAGGGGCCTCTCTTAATGAAGC 
.TATTGATTTTGCATTAAGCTACGGCATGACTAACGCTATTAATCTAGACGGGGGGGGCTCAAGCACTCTTGTTGTA 
AAATCAAATAACGCTCCTTACAAATTAAACTTCACAGCAAACATCTTTGGACAGGAAAGACCTGTCCCATTTCATT 
TAGGAATAAAACTTCCTAATTGA 

t30.nt ; 

CTGTCCAAATCAATCACCAAAAAATCCAAATACAAAATTATTAGGGATTATTTCATAAACAGCAATTATGTTCTGG 

TGAAAATTGAAAATAAAGATCTAAAATTTACCATATCAAAACCTATTTACGACAAAAAGCTAAATAATTACTTCTT 

TAAAGGCCAAACAACAAGCCATTTCTTAATTTCTAACAATGTTGACATTGCAATTAACACAAGTCCATACGAAGTT 

AAACAAAACATGTTTTTC CC AAAAGGAC TAT AC AT AT AT AAT AAAAAAATGATTTC AAAAC AAAT AAAT A^ 

GAGAGATTGTAATAAAGCACAACAAAATTATATTAAATCCCAAGGAAGACGi^VATAGAAAACTGCGATTATGGATT 

TAGCGGATTTTTTGTTTTAATCAAAAACGGAAAGTATAAAAAAAATTTTAAAGAAACi^ 

ATAGGAACTGATAAAAATAACAAGCATTTATTTCTTGTTACAATAGAAGGAAGGGGTGTCAATAATAGCAAAGGGG 
CCTCTCTTAATGAAGCTATTGATTTTGCATTAAGCTACGGCATGACTAACGCTATTAATCTAGACGGGGGGGGCTC 
AAGCACTCTTGTTGTAAAATCAAATAACGCTCCTTACAAATTAAACTTCACAGCAAACATCTTTGGACAGGAAAGA 
- CCTGTCCCATTTCATTTAGG AAT AAAAC TTCCTAATTGA ; * 

f308.aa 

MQLLKNKYPFKRALLDLFLWAIVYLASPFVNVNSEFWNVDENHFYFWISRSFLII 
IPKFKFIFLWDSVLIFIKTILIAMIVIFLIAFLLEYLLPESVL 
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EELFYRAFVITKFTQMGFPWATAILSSMFFAYGHLYYGILGFLVTFILGIFFAFTYLRYKNVYYVIFIHSFYNII 
VSSLLLFLN 

t308.aa 

NSEFWNVDENHFYFWISRSFLIIFIIYFFKLTSSYDDFRVEFFIPKFKFIFLWDSVLIFIKTILIAMIVIFLIAFL 
LEYLLPESVLVYYFQNNAGFNWKI3SKKAFFLMTFTSFFTGAFEELFYRAFVITKFTQMGFPWATAILSSMFFAY 
GHLYYGILGFLVTFILGIFFAFTYLRYKNVYYVIFIHSFYNIIVSSLLLFLN 

•f 308 .nt 

ATGC AATTGTT AAAAAATAAATATCC ATTC AAGCGGGC TTTGC TTGATC TTTTTTTGGTC T ATGC T ATTGTTT ATT 
TGGCATCTCCTTTTGTAAATGTTAATTCAGAATTTTGGAATGTTGATGAAAATCATTTTTATTTTTGGATTTCAAG 
ATCTTTTTTAATTATTTTTATAATTTATTTTTTTAAACTTACCAGTTCTTATGATGATTTTAGAGTAGAGTTTTTT 
ATTCCTAAATTTAAATTTATTTTTCTTTGGGATTCTGTTTTAATTTTTATTAAAACAATATTGATTGCAATGATAG 
TCATTTTTTTAATAGCTTTTTTGCTTGAATATTTGTTGCCAGAATCGGTACTTGTCTATTATTTTCAAAACAATGC 
TGGATTTAATTGGAAGATTAGCAGTAAAAAAGCATTTTTTTTAATGACTTTTACCTCTTTTTTTACAGGAGCTTTT 
GAAGAACTTTTTTACAGGGCTTTTGTTATTACTAAGTTTACACAAATGGGATTTCCTGTTGTAGCTACCGCCATTC 
TTAGTAGTATGTTTTTTGCTTATGGGCATTTATATTATGGAATTTTAGGATTTTTGGTTACATTTATATTAGGGAT 
ATTTTTTGCTTTTACTTATTTAAGGTATAAAAATGTATATTATGTGATTTTTATACATAGTTTTTATAATATTATT 
GTTAGCAGCTTGTTGCTTTTTTTGAATTAA 

t308.nt 

AATTCAGAATTTTGGAATGTTGATGAAAATCATTTTTATTTTTGGATTTCAAGATCTTTTTTAATTATTTTTATAA 
TTTATTTTTTTAAACTTACCAGTTCTTATGATGATTTTAGAGTAGAGTTTTTTATTCCTAAATTTAAATTTATTTT 
TCTTTGGGATTCTGTTTTAATTTTTATTAAAACAATATTGATTGCAATGATAGTCATTTTTTTAATAGCTTTTTTG 
CTTGAATATTTGTTGCCAGAATCGGTACTTGTCTATTATTTTCAAAACAATGCTGGATTTAATTGGAAGATTAGCA 
GTAAAAAAGCATTTTTTTTAATGACTTTTACCTCTTTTTTTACAGGAGCTTTTGAAGAACTTTTTTACAGGGCTTT 
TGTTATTACTAAGTTTACACAAATGGGATTTCCTGTTGTAGCTACCGCCATTCTTAGTAGTATGTTTTTTGCTTAT 
GGGCATTTATATTATGGAATTTTAGGATTTTTGGTTACATTTATATTAGGGATATTTTTTGCTTTTACTTATTTAA 
GGTATAAAAATGTATATTATGTGATTTTTATACATAGTTTTTATAATATTATTGTTAGCAGCTTGTTGCTTTTTTT 
GAATTAA 

f 31. aa ■ T " 

MKKYLFFILFLISSNNLIVSYPLSFGGGFSYQFTNYTDKTGATKFAPNFTRADHGINLNLFFDANYVLFEMSYKEA 
FVVTHNGRYFSLGLYGTYPMVFKEQVRMLFPLIGFKYAFDLSSNNFNLFFLSMGL^DLFIPDLDGLYIRPLFML^ 
ISPFSNYKNFSGLTTEIMLGFNIGWRFFN 

t31-.aa 

IVSYPLSFGGGFSYQFTNYTDKTGATKFAPNFTRADHGINLNLFFDANYVLFEMSYKEAFVVTHNGRYFSLGLYGT 
YPMVFKEQVRMLFPLIGFKYAi^DLSSNNFNLFFIjSMGLAADLFIPDLDGLYIRPLFMLSISPFSNYKNFSGLTTEI 
MLGFNIGWRFFN 

f31.nt ' " • ( • 

ATGAAGAAATATCTTTTTTTTATTTTATTTCTCATCTCTTCTAATAATTTAATTGTTTCTTATCCACTTTCTTTTG 
GTGGAGGTTTTTCTTATCAATTTACTAATTATACTGATAAAACAGGCGCCACTAAATTTGCTCCAAATTTTACCAG 
AGCAGATCATGGGATTAATTTGAATTTATTTTTTGATGCAAATTATGTACTTTTTGAAATGTCTTACAAAGAGGCT 
TTTGTTGTTACTCACAATGGGAGATATTTCTCGCTTGGGCTTTATGGAACATATCCAATGGTTTTCAAAGAGCAGG 
TTAGAATGCTTTTCCCATTAATTGGGTTTAAATATGCTTTTGATTTAAGCTCTAATAACTTCAATCTCTTTTTTTT 
AAGCATGGGGCTTGCTGCTGATCTTTTTATTCCCGATCTTGATGGTTTATATATTAGGCCTTTGTTTATGCTTTCT 
ATTTCTCCATTTTCTAATTATAAAAATTTTTCTGGGTTAACAACTGAGATTATGCTTGGATTTAATATCGGTTGGA 
GATTTTTCAATTAG - v. 
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t31 .nt 

ATTGTTTCTTATCCACTTTCTTTTGGTGGAGGTTTT 

CTAAATTTGCTCCAAATTTTTACCAGAGCAGATCATGGGATTAATTTGAATTTATTTTTTTGAjX^ 

T TTTGAAATGTCTTACAAAGAGGCTTTTGTTGTTACTCACAATGC<7AGA^ 

TATCCAATGGTTTTCAAAGAGCAGGTTAGAATGCTTTTCCCATTAATTGGGTT^ 

CTAATAACTTCAATCTCTTTTTTTTAAGCATGGGGCTTGCTGCTGATCTTTTTATTCCCGATCTTGACOGTTTAv.-. 
T ATTAGGC CTTTGTTT ATGC TTTCTATTTC TC C ATTTTC T AATTATAAAAJ\.TTTCTCTGOGT-TAAC AAC "GAGA" T 
ATGCTTGGATTTAATATCGGTTGGAGATTTTTCAATTAG 



f 939 .aa 



MKQKYENYFKKRLILNLLIFLLLACSSESIFSQLGNL^ 

IEKIDLSNSYEFINDIWISGKTYLi^QNKEEELEVCELNGKW 

NNREKIFDLQGSDKTPPQATENDKFYQISNEENLITC^ 

VMTGG YNN L DT K F K VY SNTNNY TTPIFIQD EVG E F S S Y F AR E F ND A I L I G S NNG 7 A_£ 77 7T n<7 G I 7 A1.7AJ? 3 K SVE 
PGAYNGSQLSKTGLNDIIPVSNNTIYILTQGKGLWKLENRKLTKE 

f939.aa ' ' 

CSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAAMHLFK^ 

LLAQNKEEELEVCELNGKDWTLKFKKPLKAYKFL^ 

FYQISNEENLITGNSLKIWQfcnXTNNT^^ 

PIFIQDEVGEFSSYFAREFNDAILIGSNNGFAEFTKNKEGIFAL^ 

IYILTQGKGLWKLENR 

KLTKE 

f939.nt 

ATGAAACAAAAATACGAAAACTATTTTAAAAAAAGATTAATTTTAAACC^ 
CAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATAAAJVCATGAA/TAXA 

TCC AAGAGGAATTTCTC T AGT AGGAG AAAC TC TCTAC ATTGC AGC C ATGC ATTT ATTT A^^^-AAG A-AAACGGC AAG 
ATTGAAAlAAATTGATTTGAGCAATTCTTATGAGTTTATAAA.CGACATTGTAAATATATCTGGAAAA 

t agc gc aaaac aaag aag aag aatt ag aag t ttgc g agc t aaatgg aaaag attgg ac att aaj^jattt aa-aaaac c 
• gctaaaagcatataaattcttaaaatccgtaggaagagatggcgt^^ 

aat aatcgtgag aaaatttttgatc t ac aaggatc tg ac aaaac ac c ac c ac aagc ta.ctg aaaatg ac a_-_atttt 

atcaaatatcaaatgaagaaaacttaattacaggaaatt^ 

aaacatagactatcaacaggccaaagaaataatgcctatcattaaaacaj^^^ 

gtaatgactggtggttacaataatttagatacaaaatt^^ 

tatttattcaagacgaagtaggcgaatttagcagctactct 

taataatggatttgcagaatttacaaaaaataaagaaggaat^^ 

cctggagcttataacggatctcagctaagcaaaacaggcctt^^ 

acatattaactcagtccaagggtttgtggaaattggaaaacagaaaattaactaaagajv^ 

t939.nt ' - 

t g c t c aag c g aa.t c c at attt t c ac aatt agg aaatc tgc aaaajvat aaaa.c a/tg aa 7 ac a_at a'ttttgggc ag 7 t 

caagtccaagaggaatttctctagtaggagaaactctctacattgcagccatgcatt^ 

caagattgaaaaaattgatttgagcaattcttatgagtc^ 

cttttagcgcaaaacaaagaagaagaattagaagtttg^^^ 

aaccgctaaaagcatataaattcttaaaatccgtaggaagagatggcgtaa.^ 

t aaaaat aat c gtg ag aaaatt tttg atc t ac aagg a.tc tg ac aaaac ac c ac c ac aa.gc t ac 7ga_aaa;7 ga.c aaa. 
tttt atc aaat atc aaatg aa.g aaaac tt aatt ac agg aaatt c ac tc aa-aa.t atggc a.aa.t g aa. t aa.c a_a7 ac a 7 
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ACACAAACATAGACTATCAACAGGCCAAAGAAATAATGCCTATCATTAAAACAAGCATTAGGGGCTCTTCTGAAGT 
TTTAGTAATGACTGGTGGTTACAATAATTTAGATACAAAATTTAAAGTTTACTCAAATACAAATAATTACACAACG 
CCAATATTTATTCAAGACGAAGTAGGCGAATTTAGCAGCTACTTTGCAAGAGAATTTAATGATGCGATATTAATCG 
GAAGTAATAATGGATTTGCAGAATTTACAAAAAATAAAGAAGGAATTTTTGCCCTACGGGCACCCTCAAAATCTGT 
AGAACCTGGAGCTTATAACGGATCTCAGCTAAGCAAAACAGGCCTTAATGATATTATTCCTGTATCAAACAACACG 
ATTTACATATTAACTCAGGGCAAGGGTTTGTGGAAATTGGAAAACAGAAAATTAACTAAAGAATAA 

£739. aa" ' 

MQSGLKIKLILFFCCFACSCDINYPEIKELDYKINYYFTENRLDYSMSFDFAIKVINSKDVFKLSIENKNTNEFIQ 

VINNNYSSFFIDSSLGKDILYCKDLRFNFFDKTFEDFTSCVRLFD^^ 

KDMEMLNKLSNSKVFFVKTYKDKLHPVSSVVRIDSIDILEID 

t739.aa 

CCFACSCDINYPEIKELDYKINYYFTENRLDYSMSFDFAI^ 

SLGKDILYCKDLRFNFFDKTFEDFTSCWLFDKGmVYNRELVISLGMSKYDLDDVHXTYVYKSKDMEMLNKLSNSK 
VFFVKTYKDKLHPVSSWRIDSIDILEIDKAFDNYXSFYYVEKNSNLFFKVG 

f739.nt 

ATGCAGAGCGGATTAAAAATTAAATTAATATTGTTTTTTTGTTGTTTTGCTTGTTCTTGCGACATAAATTATCCGG 
AGATAAAAGAGCTTGATTATAAGATAAATTATTATTTTACTGAAAATCGCTTAGATTACTCTATGAGTTTTGATTT 
TGCAATTAAAGTTATAAATTCAAAAGATGTTTTTAAATTATCAATAGAGAATAAGAACACTAATGAGTTTATTCAA 
GTGATTAATAATAATTATAGCTCTTTTTTTATTGATTCTAGCCTTGGAAAGGATATTCTATATTGTAAGGATTTGA 
GGTTTAATTTTTTTGATAAAACTTTTGAAGATTTTACCTCATGTGTTCGTCTTTTTGATAAGGGCATGAGAGTATA 
CAATAGAGAGCTTGTTATTTCTTTGGGTATGTCAAAATATGATTTAGATGATGTTCACAATTATGTATATAAGTCT 
AAAGATATGGAAATGTTAAACAAGT.TAAGCAATTCCAAAGTATTTTTTGTTAAAACTTATAAAGACAAACTACATC 
CGGTCTCTTCAGTTGTTAGAATTGATTCAATAGATATTCTAGAGATTGATAAAGCATTTGATAATTACATAAGTTT 
. TTATTATGTCGAAAAAAATTCAAATCTTTTTTTTAAAGTTGGCTGA - 

t739."nt ' . " 

TGTTGTTTTGCTTGTTCTTGCGACATAAATTATCCGGAGATAAAAGAGCTTGATTATAAGATAAATTATTATTTTA 
CTGAAAATCGCTTAGATTACTCTATGAGTTTTGATTTTGCAATTAAAGTTATAAATTCAAAAGATGTTTTTAAATT 
ATCAATAGAGAATAAGAACACTAATGAGTTTATTCAAGTGATTAATAATAATTATAGCTCTTTTTTTATTGATTCT 
AGCCTTGGAAAGGATATTCTATATTGTAAGGATTTGAGGTTTAATTTTTTTGATAAAACTTTTGAAGATTTTACCT 
CATGTGTTCGTCTTTTTGATAAGGGCATGAGAGTATACAATAGAGAGCTTGTTATTTCTTTGGGTATGTCAAAATA 
TGATTTAGATGATGTTCACAATTATGTATATAAGTCTAAAGATATGGAAATGTTAAACAAGTTAAGCAATTCCAAA 
GTATTTTTTGTTAAAACTTATAAAGACAAACTACATCCGGTCTCTTCAGTTGTTAGAATTGATTCAATAGATATTC 
TAGAGATTGATAAAGCATTTGATAATTACATAAGTTTTTATTATGTCGAAAAAAATTCAAATCTTTTTTTTAAAGT 
TGGCTGA . 

f742.aa '* 

MNKKHTNFSVLLLLIFLLILSFGGFGYYIYQSKLNDKNREIMLNEVKKSVIDRN^ 

MLTNTLAEIANSSPFESKDLQRDSANQILDKIKGQDNTKTNVNENFDIAFNNRYIKDSTITENYSDRNDDVGIEDE- 

DISEFKKSKIPEKIKPNTNPKEEDQIIQSPNPKLSVNDQKNLFNLEKLKKNLSGKSNSENILNDSQKIENDKQNTN 

LSKEKNSENILKTPDNSKYSNNNNTTSLKKISSNSQKESELSPPSQTIIGKIYRPYSYLIKKELYEILDDINTGRV 

TLGKNRLKSLIKKGLSNKFQKVNELIENSKNKEASNLLLTLIKKDIEPNLINIPKDPYKKEIFQLDKEDKKPQYLE 

DLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKNNP^AQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQ 

AIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKNNPNDAQ 

AQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKD^ 

DLENTKSRQQAIKDLNEFXKNNPNDAQASKTLAQANKIQKLEDLKSKVHSIKPIDLENTKSRQQAIKDl^EFLKI^ 
PNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDL^ 

AENAYEKIIKLTNTQEDHYKLGIIRFKLKKYEHSIESFDQTIKIjDPKHKKA.LHNKGIALMMLNKNKKAIESFEXAI 
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QIDKOTGTAYYQKGIAEEK^GDMQQAFASFKNAYNLDKNPNYALKAGIVSNNLGNFKQSESYLNFFNANAKKPNEI 
AIYNLSIAKFENNKLEESLETINKAIDLNPEKSEYLYLKASINLKKENYQNAISLYSLVIEKNPENTSAYINLAKA 
YEKSGNKSQAI STLEKI INKNNKLALNNLGI LYKKEKNYQKAIEIFEKAI INSDI EAKYNLATTLIEINDNTRAKD 
LLREYTKLKPNNPEALHALGIIEYNENNNDQTLRELIKKFPNYXKNENIKKIIGI 

t742.aa 

KLNDKNREIMLNEVKNSVIDRNYKKAYSVAKLLQDKYPQNEDIAM^ 
KGQD ■ 

NTKT^^raENFDIAFNNRYIKDSTITE^^^SDR^^DDVGIEDEDISEFKKSKIPEKIKPNTNPKEEDQIIQSPNPKLSV 
NDQKNLFNLEKLKKNLSGKSNSENILNDSQKIENDKQNTNLSKEKNSENILKTPDNSKYSNNNNTTSLKKISSNSQ 
KESELSPPSQTIIGKIYRPYSYLIKKELYEILDDINTGRVTLGKNR-LKELIKKGLSNKFQKVNELIENSKNKEASN 
LLLTLIKKDIEPNLINIPKDPYKKEIFQLDKEDKKPQYLEDLKSKVHSIKPIDLENTKSRQQAIKDLNEFLKNNPN 
DAQASKTLAQANKIQHLEDLKSKVHSIRPIDLENTKSRQQAIKDLNEFLKNNPNDAQASKTLAQANKIQHL'EDLKS 
KVHSIKPIDLENTKSRQQAIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLENTKSRQQAIKD 
LNEFXKMSFPNDAQASKTLAQANKIQHLEDLKSKVHSI^ 

KIQHLEDLKSKVHSIKPIDIjENTKSRQQAIKDLNEFLKNNPNDAQASKTLAQANKIQHLEDLKSKVHSIKPIDLEN 

tksrqqaikdlnefxknnpndaqasktlaqayenngdllkaenayekiikltntqedhyklgxirfklkkyehsie 
sfdqtikldpkhkkalhnkg1almmlnknkkaiesfekaiqidknygtayyqkgiae 

dknpnyalkagivsnnlgnfkqseeylnffnanakkpneiaiynlsiakfennkleesletinkaidlnpekseyl 
ylkas inlkkenyqnai s l ys l vi eknp ents ay inlaka yeksgnksqai stleki inknnklalnnlgi lykke 
knyqkaieifekaiinsdieakynlattlieindntrakdllreytklkpnnpealhalgi ieynennndqtlrel • 
ikkfpnykknenikkiigi 

f'742.nt 

ATGAATAAAAAACATACAAATTTTTCGGTATTATTGCTTTTAATTTTCTTACTTATCTTATCATTTGGGGGCTTTG 
GTTACTATATATATCAAAGCAAATTAAATGACAAAAATCGAGAAATAATGCTAAACGAAGTTAAAAATAGCGTAAT 
AGATCGAAACTATAAAAAAGCATATTCTGTTGCAAAACTT 

ATGCTTACAAATACACTAGCAGAAATTGCCAACAGTAGTCCTTTTGAATCAAAAGACTTGCAAAGAGATTCTGCTA 

ATCAAATCTTAGACAAGATCAAAGGTCAAGACAATACAAAAACAAATGTAAACGAAAATTTTGATATAGCATTTAA 

TAATAGATACATTAAAGACAGCACAATAACAGAAAACTACTCTGACAGAAACGATGATGTTGGCATTGAAGATGAA 

GACATATCTGAATTTAAAAAAAGCAAAATCCCAGAAAAAATAAAACCAAATACAAACCCAAAA 

TAATACAATCTCCAAATCCGAAATTAAGTGTT 

TTTAAGTGGAAAATCAAATAGTGAAAATATTTTAAACGATTCTCAA 

TTATCCAAAGAAAAAAATTCGGAGAATATTTTAAAAACTCCGGACAACAGTAAATATTCAAACAATAACAATACTA 
CATCTTTAAAAAAAATTTCTTCAAATTCCCAAAAAGAAAGTGAGCTTTCTCCACCCAGTCAAACAATAATAGGGAA 
AATTTATAGGCCATATAGCTACTTGATAAAAAAAGAGCTCTATGAAATATTAGACGATATTAATACCGGAAGAGTC 
ACACTTGGAAAAAACAGATTAAAAGAATTAATTAAAAAAGGTC 

TTGAAAATTCAAAAAATAAAGAAGCTTCAAATTTACTACTAACCTTAATAAAAAAAGATATTGAACCAAATC 

TAATATACCAAAAGATCCTTACAAAAAAGAAATTTTTCAATTAGATAAAGAAGACAAAAAGCCTCAGTACCTAGAG 

GACCTTAAATCTAAAGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGG 

ATCTAAACGAATTCTTGAAAAACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACA* 

ACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAA 

GCCATTAAGGATCTAAACGAATTCTTGAAAAACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTA 

ATAAAATACAACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATC 

ACGCCAACAAGCCATTAAGGATCTAAJVCGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTTA 

GCTCAAGCTAATAAAATACAACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAA 

ACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAGGCCTCTA 

AAACTTTAGCTCAAGCTAATAAAATACAACACCTGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGAT 

CTTGAAAACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAG 

GCCTCTAAAACTTTAGCTCAAGCTAATAAAATACAACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAAC 

CCATTGATCTTGAAAACACAAAAT 

CACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTT 
AGCTCAAGCTAATAAAATACAACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAA 
AACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCCAGGCCTC 



126 

TABLE 1. - Nucleotide and Amino Acid Sequences 



TAAAACTTTAGCTCAAGCTTATGAAAACAATGGAGATTTGCTAAAAGCAGAAAATGCATACGAAAAAATTATCAAA 
CTCACAAATACCCAAGAAGATCACTATAAACTTGGAATCATTAGATTCAAGCTTAAAAAGTATGAACACTCAATAG 
AATCATTTGATCAAACAATAAAACTCGACCCAAAACATAAAAAAGCACTTCATAACAAAGGAATAGCTTTAATGAT 
GCTAAATAAAAACAAAAAAGCAATAGAATCTTTTGAGAAAGCAATACAAATTGATAAAAATTATGGCACCGCCTAC 
TACCAAAAAGGAATAGCAGAAGAAAAAAATGGCGATATGCAACAAGCATTTGCAAGCTTTAAAAATGCCTACAATC 
TCGACAAAAACCCCAATTATGCATTAAAAGCAGGAATAGTATCAAATAACTTGGGCAACTTCAAACAAAGTGAAGA 
GTATTTAAATTTTTTTAATGCCAATGCAAAAAAACCTAACGAAATTGCTATTTACAACCTATCAATAGCAAAATTT 
GAAAACAATAAACTTGAAGAATCTCTTGAAACAATAAACAAAGCCATAGATTTAAATCCAGAAAAAAGTGAATATT 
TATATTTAAAAGCATCTATAAATCTTAAAAAAGAAAATTACCAAAATGCTATATCACTTTACAGCTTAGTAATTGA 
AAAAAACCCTGAAAATACTTCAGCCTATATAAACCTGGCAAAAGCATATGAAAAATCAGGAAATAAAAGTCAAGCA 
ATCTCAACTCTTGAAAAGATAATAAACAAAAATAATAAATTAGCCTTAAACAATCTTGGGATACTTTACAAAAAAG 
AAAAAAATTATCAAAAAGCAATTGAAATTTTTGAAAAAGCAATAA 

TGCAACCACTCTAATTGAAATTAATGATAACACAAGAGCTAAAGACCTTCTAAGAGAATATACAAAATTAAAACCA 
AACAATCCAGAGGCCTTACATGCACTAGGAATAATAGAATATAATGAAAATAACAATGATCAAACACTAAGAGAAC 
TTATAAAAAAATTTCCAAATTACAAAAAAAATGAAAATATTAAAAAAATAATAGGAA 

t742 .nt ' 

AAATTAAATGACAAAAATCGAGAAATAATGCTAAACGAAGTTAAAAATAGCGTAATAGATCGAAACTATAAAAAAG 
CATATTCTGTTGCAAAACTTCTGCAAGACAAATACCCCCAAAATGAAGACATTGCAATGCTTACAAATACACTAGC 
AGAAATTGCCAACAGTAGTCCTTTTGAATCAAAAGACTTGCAAAGAGATTCTGCTAATCAAATCTTAGACAAGATC 
AAAGGTCAAGACAATACAAAAACAAATGTAAACGAAAATTTTGATATAGCATTTAATAATAGATACATTAAAGACA 
GCACAATAACAGAAAACTACTCTGACAGAAACGATGATGTTGGCATTGAAGATfeAAGACATATCTGAATTTAAAAA 
' AAGCAAAATCCCAGAAAAAATAAAACCAAATACAAACCCAAAAGAAGAAGACCAAATAATACAATC^ 
AAATTAAGTGTTAATGACCAAAAAAATTTATTTAATTTGGA^ 

GTGAAAATATTTTAAACGATTCTCAAAAAATAGAAAATGATAAGCAAAACACAAATTTATCCAAAGAAi^ 

GGAGAATATTTTAAAAACTCCGGACAACAGTAAATATTCAAACAATAACAATACTACATCTTTAAAAAAAATTTCT 

TCAAATTCCCAAAAAGAAAGTGAGCTTTCTCCACCCAGTCAAACAATAATAGGGAAAATTTATAGGCCATATAGCT 

ACTTGATAAAAAAAGAGCTCTATGAAATATTAGACGATATTAATACCGGAAGAGTCACACTTGGAAAAAACAGATT 

AAAAGAATTAATTAAAAAAGGTCTAAGCAACAAATTTCAAAAA 

GAAGCTTCAAATTTACTATTAACCTTAATAAAAAAAGATATTGAACCAAATCTCATTAATATACCAAAAGATCCTT 
ACAAAAAAGAAATTTTTCAATTAGATAAAGAAGACAAAAAGCCTCAGTACCTAGAGGACCTTAAATCTAAAGTTCA 
TTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGAATTCTTGAAA 
AACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACAACACCTAGAGGACCTTAAAT 
CTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGGATCTAAACGA 
ATTCTTGAAAAACAATCCCAATGACGCTCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACAACACCTAGAG 
GACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACAAGCCATTAAGG 
ATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTTAGCTCAAGCTAATAAAATACA 
ACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAAGACAAAATCACGCCAACAA 
/GCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAGGCCTCTAAAACTTTAGCTCAAGCTAAT 
AAAATACAACACCTGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCT^ 

CCAACAAGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCAGGCCTCTAAAACTTTAGCTCA 
AGCTAATAAAATACAACACCTAGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAi^ACACA 
AAATCACGCCAACAAGGCATTAAGGATCTAAACGAATTCTTAAAAAACAATCCCAATGACGCCCAGGCCTCTAAAA 
CTTTAGCTCAAGCTAATAAAATAC 

AACACCTGGAGGACCTTAAATCTAAGGTTCATTCAATAAAACCCATTGATCTTGAAAACACAAAATCACGCCAACA 

AGCCATTAAGGATCTAAACGAATTCTTAAAAACAATCCCAATGACGCCCAGGCCTCTAAAACTTTAGCTCAAGCTT 

ATGAAAACAATGGAGATTTGCTAAAAGCAGAAAATGCATACGAAAAAATTATCAAACTCACAAATACCCAAGAAGA 

TC ACTAT AAAC TTGGAATC ATT AGATTC AAGC TT AAAAAGTATGAAC AC TC AATAGAATC ATTTGATC AAAC AATA 

AAAC TCG AC C C AAAAC AT AAAAAAGC AC TTC AT AAC AAAGGAAT AGC TTT AATGATGC TAAAT AAAAAC AAAAAAG 

CAATAGAATCTTTTGAGAAAGCAATACAAATTGATAAAAATTATGGCACCGCCTACTACCAAAAAGGAATAGCAGA 

AGAAAAAAATGGCGATATGCAACAAGCATTTGCAAGCTTTAAAAATGCCTACAATCTCGACAAAAACCCCAA 

GCATTAAAAGCAGGAATAGTATCAAATAACTTGGGCAACTTCAAACAAAGTGAAGAGTATTTAAATTTTTTTAATG 

CC AATGC AAAAAAAC C T AACGAAATTGCTATTT ACAACCT ATC AAT AGC AAAATTTGAAAAC AAT AAACTTG AAG A 

ATCTCTOGAAACAATAAACAAAGCCATAGATTTAAATCCA 

AATCTTAAAAAAG AAAATTAC C AAAATGC T ATATCACTTT AC AGCTTAGT AATTG AAAAAAACCC TGAAAATAC TT 
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CAGGCTATATAAACCTGGCAAAAGCATATGAAAAATCAGGAAATAAAAGTCAAGCAATCTCAACTCTTGAAAAGAT 
AATAAAC AAAAATAATAAATTAGC C TTAAAC AATC TTGGGAT AC TTT AC AAAAAAGAAAAAAATTATC AAAAAGC A 
ATTGAAATTTTTGAAAAAGCAATAATCAATTCAGATATTGAAGCAAAATATAATCTTGCAACCACTCTAATTGAAA 
TTAATGATAACACAAGAGCTAAAGACCTTCTAAGAGAATATACAAAATTAAAACCAAACAATCCAGAGGCCTTACA 
.TGCACTAGGAATAATAGAATATAATGAAAATAACAATGATCAAACACTAAGAGAACTATAAAAAAATTTCCAAATT 
ACAAAAAAAATGAAAATATTAAAAAAATAATAGGAATATAA 

f743.aa 

MRIYLFLNKNYKIFILFLILILNSKLAYSQRLIRIGKEE 

NQLTEAEGALLDGIAVGGEIDYILYYELGNIMFNRGEGYYPLAIKYYSNSIKSRPNYDSALLNRANAYVQQGKITS 
KEKEYQKAWDSYTMAIHDYSQFITLRSKTEKKDSILLIISYLRNEKINLEQLDKSLKGRTEHIVYAKEDKNQILKD 
SFKDNLETNSLIELEKLNWQEELYIDE 

t743.aa 

YSQRLIRIGKEEMKNKNYIQAIETLSDAIKKYPKVQLGYYFLSXAYRENKQLTEAEGALLDGIAVGGEIDYILYYE 
LGNIMFNRGEGYYPLAIKYYSNSIKSRPNYDSALLNRA 

KTEKKDSILLIISYLRNEKINLEQLDKSLKGRTEHIVYAKEDi^QILKDSFKDNLETNSLIELEKLNWQEELYIDE 
f743.nt 

ATGAGGATTTATTTATTTTTAAATAAAAATTACAAGATTTTTATTTTATTTTTAATTTTAATATTAAATTCAAAAT 
TGGC 

ATATTCTCAAAGGCTAATTAGAATTGGCAAAGAAGAGATGAAAAACAAAAATTACATTCAAGCAATCGAAACACTA 
AGTGATGCTATTAAAAAATATCCAAAAGTACAACTCGGCTATTACTTTTTATCAATAGCATACAGAGAAAATAATC 
AACTAACAGAAGCAGAAGGAGCATTGCTCGATGGAATTGCAGTAGGGGGTGAAATCGACTACATACTATATTATGA 
ATTAGGCAACATAATGTTTAACAGAGGGGAAGGTTACTATCCTTTAGCAATAAAATATTATTCTAATTCTATTAAA 
AGTAGACCTAATTATGACAGTGCGCTACTAAACAGAGCTAATGCCTATGTTCAACAGGGCAAAATAACTTCTAAAG 
AAAAAGAATACCAAAAAGCTTGGGACTCTTATACTATGGCTATCCACGACTACTCTCAATTTATTACCCTTAGATC 
AAAAACAGAAAAAAAAGACAGCATTTTGCTTATAATAAGCTATTTAAGAAATGAAAA^ 

GACAAAAGTTTGAAGGGGCGAACCGAGCATATTGTATACGCAAAAGAAGATAAAAATCAAATACTTAAAGATAGTT 
TTAAAGACAACCTAGAAACAAATTCTTTAATTGAGCTAGAAAAACTTAATTGGCAAGAGGAGTTATACATAGATGA 
ATAA 

t743 .nt 

TATTCTCAAAGGCTAATTAGAATTGGCAAAGAAGAGATGAAAAACAAAAATTACATTCAAGCAATCGAAACACTAA 

GTGATGCTATTAAAAAATATCCAAAAGTACAACTCGGCTATTACTTTTTATCAATAGCATACAGAGAAAATAATCA' 

ACTAAC AGAAGC AGAAGG AGC ATTGC TCGATGGAATTGC AGTAGGGGGTGAAATC GACT AC AT AC TAT ATT ATGAA 

TTAGGCAACATAATGTTTAACAGAGGGGAAGGTTACTATCCTTTAGCAATAAAATATTATTCTAATTCTATTAAAA 

GTAGACCTiV^TTATGACAGTGCGCTACTAAACAGAGCTAATGCCTATGTTCAACAGGGCAAAATAACTTCTAAAGA 

AAAAGAATACCAAAAAGCTTGGGACTCTTATACTATGGCTATCCACGACTACTCTCAATTTATTACCCTTAGATCA 

AAAAC AG AAAAAAAAG AC AGC ATTTTGC TT AT AAT AAGC T ATTT AAGAAATG AAAAAATT AATC TTG AAC AAC TTG 

ACAAAAGTTTGAAGGGGCGAACCGAGCATATTGTATACGCAAAAGAAGATAAAAATCAAATACTTAAAGATAGTTT 

TAAAGACAACCTAGAAACAAATTCTTTAATTGAGCTAGAAAAACTTAATTGGCAAGAGGAGTTATACATAGATGAA 

TAA 

f748.aa ' , ' 

mkfiinlllstikiitftvivcltilsifqpiyilkeneisittrlgkiqrtenlaglkykiplienvqifpkiil 

rwdgepqriptggeekqliwxdttarwkiadinkfyttiktmsrayvridaaiepavt^gvi-akyplleiirssndp 

iqrlsngiltpqetkingiykitkgrkiiekeiiriannntkdigieiv^ 

iaeeqrsigi^ekteilgsiekeklkilselakataakik^ 

krkifstdmdffqylhkrn 
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t7 48.aa 

I FQ P I Y I L XENE I S I TTRLGX I QRT ZNLAGLKYX I ? L I ENVQ I F ? KIILRWDGEPQRIPTGGEEKQLIWIDTTARW 
K I AD I NX F YTT I XCMS ?Al r v?. I DAA.T E ? A"/?.GVIA:>C/?LL2IIRSSND?IQRLSNGILTPQETKINGIYKITKGRK 

i i exe 1 1 ri ann:~xd ig i z i vd vx z ?_xvtvd ? s l i e s vnn? ;m 1 5 erqqiaeeqrsiglaekteilgsiekeklki 

L S EAKAT AAK I XAZ GDF.ZAA.X I "£ SNA. YGXN X E F YXF7/Q AL E S YKAVL XD KRX I F S T DMD F FQ YLHXRN 
f748.nt 

ATGAAATTT ATAATAAATC TTTTATTATC TAG TAT AAAGA.TT AT AAC C TTT AC AGT AAT AGTTTGCTTG AC TATTT 

TGTCTATTTTCCAGCCAA/TTTATATTTTGAAJIGAAA^ 

AACTGAAAJ^TTTAGCTGGACTTAJLATATA^ 

AGA.TGGG ATGGAGAAC CT C AAAG AATC C C AAC AGGAGGGGAAGAAAAGC AATT AAT ATGGATTGAT AC AAC TGCT A 
GATG^AAAATTGCA.GACATAAJVTAJLATTTTACACAAC^ ' 
AGC AATTG AAC C TGC TGTT AGGGGGG TTATTGCAAAAT AC C C TTTGC TTG AAATTAT AAGAAGCTC AAAC GATC CT 
ATTCAACGTTTGTCTAATGGAATACTCACCCCACAAGAAA 
GAAAGATAAJTCGAAAAAGA^TAATT^ 

AC T AATAA.GAAAAGTT AC TT A'TGACC C AAGC C TT ATTGAATC TGT AAAC AAC AG AA.TG ATC TC AGAAAGAC AAC AA 
ATCGCAGAAGAACAJ\AGAAGCAT\\GGATTAGCTGA^ 

AAAT ATT AAGTGAAGC AAAAGC C AC TGC TGC AAAAATAAAAGCC GAAGGGG AT AGAGAAGC CGCAAAAATTTATTC 
AAATGCATATGGCAAAAATATTGAA-rTTTACAAATTCTC^ 

AAAAGAAAAATTT TC T C AAC A.G AC A." GG AT TT C TTT C AAT AT C T T C AC AAAAG AAATTG A 

t748 .nt . . ' 

ATTTTCCAGCCAAT-TTATATTTTGAA 
AAAATTTAGCTGGA-CTTAAATATAJ^ATACCATC 

GGATGGAG AAC CTC AAAG AATC CC AAC A.GGA.GGGGAAGAAAAGC AATT AAT ATGGATTGAT AC AAC TGC T AGATGG 
AAAATTGCAGACATAJ^TAAATC^ 

TTGAACCTGCTGTTA.GGGGGGTTATTGCAAJLATAC^ 

AC GTTTGTC T AATGG AAT AC TC ACC C CA.C A.AG AAAC AAAAATT AAC GGT ATTT AT AAAAT AAC AAAAGGACGAAAG 
ATAATCGAAJ^J3AJiATAATTCGT^ 

TAAGAAAA.GTT AC C~T ATGAC C C AA.GC C TT 5 ATTGAATC TGT AAAC AAC AGAATG ATC TC AGAAAGAC AAC AAATC GC 
AGAAGAAC AAAGAAGC AT AGG ATT AGC TGAAAAAAC AGAAATTC TTGG AAGC ATAG AAAAAGAAAAACTGAAAAT A 
TTAAGTGAAGCAAAAGCCACTGCTGCAAAJ^^^ 

C AT ATGGC AA-A.AAT A.T TG AA_TTTT AC AAA.T T C TGGC AGGC ATT AG AAAGC T AT AAAG C AGT ATT AAAAG AT AAAAG 
AAAJ^ATTTTCTCAACAGACAJTGGATT^TCTTTCAATATCTTCACAAAAGAAATTGA . _ 

f764.aa 

MSG ? XXLA I 1 AI_ L VT S I QGC X E 3 3 1 1 E XQ r NY A 1 1 F S DAT E Y F F E I QTT P F I KNE I L F I ND XNLE I 1 KDKLXTTKK 
ILLTHXSNNEIL^TEILXEKIFYuSX^ 

TLINEXNISYIQTFITSQIXTIILF3L?_DNNIILXXILNS?FSKNIXFVLIGNTRXDLKIIKLKYIITLKEPDLIK 
IAXDVEXDFQYZFNIYXQ - 

f754.aa 

EKQFNYAIIFSDATZYFFEIQTCPFIXNEILFINDKN^ 

ixfslxj<sidfl^~<sidlqx~llfrd:<s£^^ 

lrdnniilxxilnspfsxnixf-.t.igntfxidlxiixlxyiitlxspdlixiaxdvexdfqyefniyxq 

f 754 . r.c * ' 

ATGTC TGGCCC T AAAAAAC TTGCT AT AATAGCGC TC TTAGT AATTTC AAT AC AAGG ATGC AAAG AATCTTCT ATT A 
TTGAAAAAC AATTT-AATT ATGC AJVT AATTTTTTC AG ATGC AAC TG AAJT ATTTTTTTG AAATTC AAAC AAC TCC ATT 
C AT AAAAAAC GAAAT AC T ATTT AT AAATG AC AAAAATTT AGAJ^ATTATAAAAGACAAGCTTAAAACAACAAAAAAA 
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ATACTATTAACTCATAAATCAAATAATGAAATTCTAAATAACGAAATTCTAAAAGAGAAAATTTTTTATCTATCAA 

AAATAAAATTTTCTCTAAAAAAATCTATTGACTTTCTGCTTAACGAAAAATCAATAGATTTGCAAAAAACATTACT' 

ATTTAGAGACAAATGTCTAAATAACGAAGACCTTGAATACTTGGAAAAAAAAGGCAAAGAAAAAAATGTCAATATT 

ACTCTAATAAACGAAAAAAACATATCCTATATTCAAACATTCATTACTTCTCAAATAAAAACAATAATATTATTCT 

CTTTAAGAGATAATAATATTATTTTAAAAAAGATACTAAATTCGCCTTTTTCTAAAAATATAAAATTTGTATTAAT 

TGGCAATACAAGAAAAGACTTAAAAATTATTAAGCTAAAATATATAATCACCCTTAAAGAGCCTGATTTGATAAAA ■ 

ATAGCAAAAGATGTTGAAAAAGATTTTCAATATGAATTTAACATTTATAAACAATAA 

t764.nt 

GAAAAACAATTTAATTATGCAATAATTTTTTCAGATGCAACTGAATATTTTTTTGAAATTCAAACAACTCCATTCA 

TAAAAAJVCGAAATACTATTTATAAATGACAAAAATTTAGAAATTATAAAAGACAAGCTTAAAACAA 

ACTATTAACTCATAAATCAAATAATGAAATTCTAAATAACGAAAT 

ATAAAATTTTCTCTAAAAAAATCTATTGACTTTCTGCTTAACGAAAAATCAATAGATTTGCAAAAAACATTACTAT 

TTAGAGACAAATCTCTAAATAACGAAGACCTTGAATACTTGGAAAAAAAAGGCAAAGAAAAA^ 

TCTAATAAACGAAAAAAACATATCCTATATTCAAACAT 

TTAAGAGATAATAATATTATTTTAAAAAAGATACTAAATTCGCCTTTTTCTAAAAATATAAAATTTGTATTAATTG 
GCAATACAAGAAAAGACTTAAAAATTATTAAGCTAAAJVTAT^ 

AGCAAAAGATGTTGAAAAAGATTTTCAATATGAATTTAACATTTATAAACAATAA 
f770.aa 

MINFSKSFFYPLPIGKIFVLSGDMGSGKTSFLKGLALNLGISYFTSPTYNIVNVYDFINFKFYHIDLYRVSSLEEF 
ELVGGLEILMDLDSIIAIEWPQIALSIVPKDRLFSLTFKIVGSGRWELNG 

t770.aa 

KTSFLKGLALNLGISYFTSPTYNIVTSTVYDFINFKFYHIDLYRVSSLEEFELVGGLEILMDLDSIIAIEWPQIALSI 
VPKDRLFSLTFKIVGSGRWELNG 

f770;nt 

ATGATAAATTTTTCCAAATCTTTTTTTTATCCTTTGCCAATTGGTAAAATATTTGTTTTAAGTGGTGACATGGGAT 
CTGGAAiU^CTAGTTTTTTAAAGGGACTTGCCCTTAACCTTGGAATTTCTTATTTTACAAGTCCAACTTATAACAT 
TGTTAATGTTTATGATTTTATAAATTTTAAATTTTATCATATTGA.TTTATATCGGGTGTCTTCTTTGGAAGAATTT 
GAGCTTGTTGGGGGATTGGAAATACTTATGGATCTTGACTCGATTATTGCTATTGAATGGCCACAAATTGCTTTGA 
GCATTGTTCCAAAAGATAGATTATTTTCTTTAACTTTTAAAATAGTAGGTTCAGGCAGGGTTGTAGAACTTAATGG 
TTAA " ■ , . 

t770.nt 

AAAACTAGTTTTTTAAAGGGACTTGCCCTTAACCTTGGAATTTCTTATTTTACAAGTCCAACTTATAACATTGTTA 
ATGTTTATGATTTTATAAATTTTAAATTTTATCATATTGATTTATATCGGGTGTCTTCTTTGGAAGAA.TTTGAGCT 
TGTTGGGGGATTGGAAATACTTATGGATCTTGACTCGATTATTGCTATTGAATGGCCACAAATTGCTTTGAGCATT 
GTTCCAAAAGATAGATTATTTTCTTTAACTTTTAAAATAGTAGGTTCAGGCAGGGTTGTAGAACTTAATGGTTAA 

f790.aa 

MNTKATTPLLLLFLIQSLAFSSEIFEFKYIKGSKFRLEGTDNQKIYFNGHYNSSSNTNIQISSEIKDIKENFASIK 
AFFRILKRENINEPYLLNEEFEEIFSVNKQGEYTIGANQKRPSVRGIPRFPKTPIKINEKWSYLAEEYIEASKIDK 
SIKDFV~\/TCFNV>JYEYKGKEEHNGKHYHIIL^ 

NQHFKMIGNSLGRIVSIELPNDNLIETEVENYIREKKIKAIEV^ 

AKLLEKFKKNNILIEGHTEQFGLEEEMHELSEKRARAIGNYLIKMKVXDKDQILFKGWGSQKPKYP 
RVEITILNN 



t790 . aa 
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SEIFEFKYIKGSKFRLEGTDNQKIYFNGHYNSSSNTNIQISSEIKDIKENFASIKAFFRILKRENINEPYLLNEEF 
EEIFSVNKQGEYTIGANQKRPSVRGIPRFPKTPIKI-NEKWSYLAEEYIEASKIDKSIKDFWKFNVNYEYKGKEEH 
NGKHYHIILSNYESQYNVKNISFYQKVDQKIYFDN^ 

DNLIETEVENYIREKKIKAIEVEKNNKGINLSFDIEFYPNSFQILQKEYKKIDLIAKLLEKFKiCNNILIEGKTEQF 
GLEEEMHELSEKRARAIGNYLIKMKVKDKDQIL^ 

■ f790..nt ' ' , 

ATGAATACCAAGGCGACTACACCATTGTTGTTATTATTTTTAATTCAAAGCTTAGCTTTTTCTTCTGAAATCTTTG 
AATTTAAATACATTAAAGGTTCAAAGTTTAGATTAGAAGGCACAGATAATCAAAAAATATATTTCAATGGCCATTA 
TAATTCAAGCTCTAATACCAATATTCAAATTTCAAGTGAAATAAAAGACATAAAAGAAAACTTTGCAAGCATTAAA 
GCTTTTTTTAGAATCTTAAAAAGAGAAAATATTAATGAACCTTACCTATTAAATGAAGAGTTTGAAGAAATCTTCA 
GCGTAAATAAGCAAGGAGAATATACAATAGGAGCAAATCAAAAAAGACCTTCTGTTAGAGGTATTCCAAGATTCCC 
AAAAACACCAATCAAAATAAATGAAAAATGGTCATATCTTGCAGAAGAATATATAGAAGCGTCAAAAATAGACAAA 
AGTATAAAAGATTTCGTTGTAAAA.TTTAATGTTAACTACGAATATAAAGGCAAAGAAGAGCACAATGGCAAGCATT 
ACCACATAATTCTTTCGAATTATGAATCACAATACAATGTAAAAAACATCTCTTTCTATCAAAAAGTAGACCAAAA 
AATTTATTTTGATAATGAAATTGGCAATACATATAAATACAGCGATAAATATATATTTGAAATAAATCAGAACAAC 
AACCAACATTTTAAAATGATTGGAAACTCTCTTGGCAGAATAGTTTCAATTGAGCTTCCAAATGATAATCTTATTG 
AAACTGAGGTTGAAAATTACATCCGAGAAAAAAAAATAAAAGC^ 

TTTAAGCTTTGACATTGAATTTTATCCTAACTCATTTCAAATACTACAAAAAGAATATAAAAAAATTGACCTTATA 
GCTAAACTTCTTGAAAAATTTAAAAAAAATAACATACTAATAGAAGGACATACTGAGCAATTTGGATTGGAAGAAG 
AGATGCACGAGCTATCTGAAAAAAGAGCTCGTGCAATTGGAAATTATTTAATAAAAATGAAAGTAAAAGACAAAGA 
CCAAATACTATTTAAAGGATGGGGATCTCAAAAACCAAAATATCCTAAGTCCTCCCCATTAAAGGCTAAAAATAGG 
CGAGTAGAAATTACAATATTAAATAACTAA 

. t790.nt 

TCTGAAATCTTTGAATTTAAATACATTAAAGGTTCAAAGTTTAGATTAGAAGGCACAGATAATCAAAAAATATATT 

TCAATGGCCATTATAATTCAAGCTCTAATACCAATATTCAAATTTCAAGTGAAATAAAAGACATAAAAGAAAACTT 

TGCAAGCATTAAAGCTTTTTTTAGAATCTTAAAAAGAGAAAATATTAATGAACCTTACCTATTAAATGAAGAGTTT 

GAAGAAATCTTCAGCGTAAATAAGCAAGGAGAATATACAATAGGAGCAAATCAAAAAAGACCTTCTGTTAGAGGTA 

TTCCAAGATTCCCAAAAACACCAATCAAAATAAATGAAAAATGGTCATATCTTGCAGAAGAATATATAGAAGCGTC 

AAAAATAGACAAAAGTATAAAAGATTTCGTTGTAAAATTTAATGTTAACTACGAATATAAAGGCAAAGAAGAGCAC 

AATGGCAAGCATTACCACATAATTCTTTCGAATTATGAJ^TCACAATACAATGTAAAAAACATCTCTTTCTATCAAA 

AAGTAGACCAAAAAATTTATTTTGATAATGAAATTGGCAATACATATAAATACAGCGATAAATATATATTTGAAAT 

AAATCAGAACAACAACCAACATTTTAAAATGATTGGAAACTCTCTTGGCAGAA'TAGTTTCAATTGAGCTTCCAAAT 

GATAATCTTATTGAAACTGAGGTTGAAAATTACATCCGAGAAAAAAAAATAAAAGCTATTGAAGTTG^^ 

ATAAAGGTATTAATTTAAGCTTTGACATTGAATTTTATCCTAACTCATTTCAAATACTACAAAAAGAATATAAAAA 

AATTGACCTTATAGCTAAACTTCTTGAAAAATTTAAAAAAAATAACATACTAATAGAAGGACATACTGAGCAATTT 

GGATTGGAAGAAGAGATGCACGAGCTATCTGAAAAAAGAGCTCGTGCAATTGGAAATTATTTAATAAAAATGAAAG 

TAAAAGACAAAGACCAAATACTATTTAAAGGATGGGGATCTCAAAAACCAAAATATCCTAAGTCCTCCCCATTAAA 

GGCTAAAAATAGGCGAGTAGAAATTACAATATTAAATAACTAA 

f792.aa * 

MKIFIYWWIFFFSVFKVFSIYSLTDEEFFKKYSLFFVHKGFLSKNWGKITKVQWGINSRWVYP 

SIYEDVYSSSSFLTTSNNLYVSYDYSKNFRKLVGIDKFNSGAYITSSAFSQGDYKRIAIGTAIHGIYLSVNGAISF 

KNLNRLIPQIYLGAGYYDIISAIEFSKEETIOTLYFSSGVYGDIFLISQKSGFIKKISFPFKKQIIRILDLSSKjWE 

KILVTITYDNKFYSYINGQWVFIGKLSLQDQDFFEKSQRMQLAKNKGSIYLTAYTLRNKKLAVDERFKF 

VIDFKDDNGNLTYSSKLSLPNKLRSVKNFIDVPYILKKAKELGIWIARCWFKDSKLYYYDNFKHALWNKKTNKP 

WAHLIKKVDSSGLVKYVQVT£HWVDIFSPATWEYN^ 

VTDALESFLIMAREQLYVPISVDIYGYNGWFPTNSIGQNISMLSDYV^ 

YKEGSDRALAFSLDGWIRPYVQAFLLGKERLVDDEIYLEYLKFQLTC 
LDSF 
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t792.aa 

IYSLTDEEFFKKYSLFFVHKGFLSKNVNGKITKVQVNGINSRWVYPFYKLVPSRITSIYEDVYSSSSFLTTSNNLY 
VSYDYSKNFRKLVGIDKFNSGAYITSSAFSQGDYKRIAIGTAIHGIYLSVNGAISFKNLNRLIPQIYLGAGYYDII 
SAIEFSKEETNNLYFSSGVYGDIFLISQKSGFIKKISFPFKKQIIRILDLSSKNVEKILVRTYDNHFYSYINGQWV 
FIGKLSLQDQDFFEKSQRMQLAKNKGSIYLTAYTLRNKKAVTDERFKFIKDSGMNAVVIDFKDDNGNLTYSSKLSLP 
NKLRSVKNFIDVPYILKKAKELGIWIARCWFKDSKL^ 

HWVDIFSPATWEYNISIAKEIQSFGVDEIQFDYIRFPSDGPVSLAISRMNKYEMQPVDALESFLIMAREQLYVPIS 
VDIYGYNGWFPTNSIGQNISMLSDYVDVISPMFYPSHYTDDFLPSNFYYTKRAYRIYKEGSDRAJLAFSLDGWIRP 
' YVQAFLLGKERLVDDEIYLEYLKFQLKGIKESFGSGFSLWNASNVYYMIKGSLKEYLDSF 

f792.nf 

ATGAAAATTTTTATCTATTGGGTAGTTATTTTCTTCTTTTCTGTTTTCAAGGTTTTTAGTATATATTCATTAACCG 
ATGAAGAATTTTTTAAAAAATATAGTTTATTTTTTGTTCATAAAGGATTTTTAAGTAAAAATGTTAATGGGAAAAT 
AACCAAAGTTCAAGTCAATGGGATAAATTCTAGGTGGGTTTACCCTTTTTATAAGCTTGTTCCTAGTCGAATTACT 
TCTATTTATGAGGATGTTTATTCTTCAAGTTCATTTTTGACTACAAGTAACAATCTTTATGTTTCTTATGATTATT 
CAAAAAATTTTAGAAAATTAGTAGGAATTGATAAATTTAATAGCGGTGCATATATTACATCTAGTGCCTTTTCTCA 
AGGAGATTACAAGCGTATTGCTATTGGAACTGCGATTCATGGTATTTATCTTAGTGTTAATGGAGCTATTAGTTTT 
AAAAATTTAAATCGTTTGATTCCGCAGATTTATTTAGGTGCAGGATATTACGATATTATTAGTGCTATTGAATTTT 
.CAAAAGAAGAGACAAATAATTTATATTTTTCCTCTGGAGTTTATGGAGATATTTTTTTAATTAGTCAGAAAAGTGG 
ATTTATTAAAAAAATATCTTTTCCTTTCAAAAAGCAAATAATACGTATTTTAGACTTATCTAGTAAGAATGTAGAA 
AAAATTTTAGTCAGAACATATGACAATCATTTTTATTCTTATATTAATGGGCAATGGGTATTTATTGGAAAATTAT 
CTTTGCAGGATCAGGATTTTTTTGAAAAATCACAAAGGATGCAGCTTGCTAAAAATAAAGGGTCTATTTATTTAAC 
AGCATATACATTGCGTAATAAGAAGGCAGTTGATGAAAGATTTAAATTTATTAAAGATTCAGGTATGAATGCTGTT 
GTAATTGATTTTAAAGATGATAATGGTAATTTGACTTATTCTAGCAAGCTTTCTTTGCCCAATAAGTTGAGATCTG 
TTAAAAACTTTATTGATGTTCCTTATATTCTTAAAAAAGCAAAAGAGCTTGGAATTTATGTTATTGCTAGATGTGT 
TGTATTTAAAGATTCAAAATTGTATTATTATGATAATTTTAAACACGCCCTTTGGAATAAAAAAACCAATAAACCT 
TGGGCTCATTTGATTAAAAAAGTTGATTCTAGTGGTCTTGTGAAATATGTACAAGTAGAGCATTGGGTAGATATTT 
TTTCTCCTGCTACTTGGGAATATAATATTTCTATCGCAAAAGAAATTCAATCTTTTGGAGTTGACGAGATACAATT 
TGATTATATTAGATTTCCATCAGATGGGCCTGTGTCTCTTGCAATCTCAAGAATGAATAAGTATGAGATGCAACCC 
GTTGATGCACTTGAATCTTTTTTGATTATGGCAAGAGAACAGCTTTATGTTCCTATTTCTGTTGATATTTATGGGT 
ACAATGGCTGGTTTCCTACTAATAGTATTGGGCAAAATATTTCAATGTTATCAGATTATGTTGACGTCATATCTCC 
TATGTTTTATCCTTCGCATTATACTGATGATTTTTTGCCAAGCAATTTTTATTACACAAAAAGAGCTTATAGGATT 
TATAAAGAGGGGAGTGATAGAGCACTTGCTTTTTCTTTAGATGGGGTTGTTATTAGGCCTTATGTTCAAGCTTTTT 
TATTAGGAAAAGAAAGATTGGTGGATGACGAGATTTATTTGGAGTATTTAAAGTTTCAGCTTAAAGGAATTAAAGA" 
GTCATTTGGTAGTGGCTTTAGCCTTTGGAATGCATCTAATGTTTATTATATGATTAAAGGTAGTTTAAAAGAATAT 
TTAGATTCTTTTTA 

t792.nt ~ 

ATATATTCATTAACCGATGAAGAATTTTTTAAAAAATATAGTTTATTTTTTGTTCATAAAGGATTTTTAAGTAAAA 
ATGTTAATGGGAAAATAACCAAAGTTCAAGTCAATGGGATAAATTCTAGGTGGGTTTACCCTTTTTATAAGCTTGT 
TCCTAGTCGAATTACTTCTATTTATGAGGATGTTTATTCTTCAAGTTCATTTTTGACTACAAGTAAGAATCTTTAT 
GTTTCTTATGATTATTCAAAAAATTTTAGAAAATTAGTAGGAATTGATAAJ\TTTAATAGCGGTGCATATATTACAT 
C'TAGTGCCTTTTCTCAAGGAGATTACAAGCGTATTGCTATTGGAACTGCGATTCATGGTATTTATCTTAGTGTTAA 
TGGAGCTATTAGTTTTAAAAATTTAAATCGTTTGATTCCGCAGATTTATTTAGGTGCAGGATATTACGATATTATT 
AGTGCTATTGAATTTTCAAAAGAAGAGACAAATAATTTATATTTTTCCTCTGGAGTTTATGGAGATATTTTTTTAA 
TTAGTCAGAAAAGTGGATTTATTAAAAAAATATCTTTTCCTTTCAAAAAGCAAATAATACGTATTTTAGACTTATC 
TAGTAAGAATGTAGAAAAAATTTTAGTCAGAACATATGACAATCATTTTTATTCTTATATTAATGGGCAATGGGTA 
TTTATTGGAAAATTATCTTTGCAGGATCAGGATTTTTTTGAAAAATCACAAAGGATGCAGCTTGCTAAAAATAAAG 
GGTCTATTTATTTAACAGCATATACATTGCGTAATAAGAAGGCAGTTGATGAAAGATTTAAATTTATTAAAGATTC 
AGGTATGAATGCTGTTGTAATTGATTTTAAAGATGATAATGGTAATTTGACTTATTCTAGCAAGCTTTCTTTGCCC 
AATAAGTTGAGATCTGTTAAAAACTTTATTGATGTTCCTTATATTCTTAAAAAAGCAAAAGAGCTTGGAATTTATG 
TTATTGCTAGATGTGTTGTA.TTTAAAGATTCAAAATTGTATTATTATGATAATTTTAAACACGCCCTTTGGAATAA 
AAAAACCAATAAACCTTGGGCTCATTTGATTAAAAAAGTTGATTC 
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CATTGGGTAGATATTTTTTCTCCTGCTACTTGGGAATATAATATTTCTATCGCAAAAGAAATTCAATCTTTTGGAG 
TTGACGAGATACAATTTGATTATATTAGATTTCGATCAGATGGGCCTGTGTCTCTTGCAATCTCAAGAATGAATAA 
GTATGAGATGCAACCCGTTGATGCACTTGAATCTTTTTTGATTATGGCAAGAGAACAGCTTTATGTTCCTATTTCT 
GTTGATATTTATGGGTACAATGGCTGGTTTCCTACTAATAGTATTGGGCAAAATATTTCAATGTTATCAGATTATG 
TTGACGTCATATCTCCTATGTTTTATCGTTCGCATTATACTGATGATTTTTTGCCAAGCAATTTTTATTACACAAA 
AAGAGCTTATAGGATTTATAAAGAGGGGAGTGATAGAGCACTTGCTTTTTCTTTAGATGGGGTTGTTATTAGGCCT 
TATGTTCAAGCTTTTTTATTAGGAAAAGAAAGATTGGTGGATGAGGAGATTTATTTGGAGTATTTAAAGTTTCAGC 
TTAAAGGAATTAAAGAGTCATTTGGTAGTGGCTTTAGCCTTTGGAATGCATCTAATGTTTATTATATGATTAAAGG 
TAGTTTAAAAGAATATTTAGATTCTTTTTAA 

f797.aa . - . 

MSIKKFILTLIILSLAKNSFSENEINIFENENYIV^ 

LEGISYSFINVFSKIGSSAIISFDLSNEASKKYKIIKLEFLSPDKGNFINQLSSLTSGKQQSKKELAKDAYSFGTL 
RTESLSKTIAEYYKDNNWYYILAAITVENNINKETEXYEIRINPKIYiSTDFQKKLRLHFKSNQIKKFPIPIIE 

t797.aa 

KNSFSENEINIFENENYIVKENIKTEIKKLKQSFLLASVDVAISQPYIELADLNGEPIKELEGISYSFINVFSKIG 
SSAIISFDLSNEASKKYKIIKLEFLSPDKGNFINQLSSLTSGKQQSKKELAKDAYSFGTLRTESLSKTIAEYYKDN 
. NWYYILAAITVEMNINKETEKYEIRINPKIYNDFQKKLRLHFKSNQIKKFPIPIIE " 

f797. nt 

ATGAGCATTAAAAAATTTATTTTAACCTTGATAATTCTTTCTCTAGCTAAAAATAGCTTTTCTGAAAACGAAATTA 
ATATCTTCGAAAACGAAAATTATATTGTAAAAGAAAATATAAAAACAGAS^ 

ACTTGCATCTGTTGATGTCGCCATTAGCCAACCCTACATAGAATTGGCAGATTTAAATGGAGAACCGATAAAAGAA 
CTTGAAGGGATTAGTTATTCATTTATAAATGTATTTTCAAAAATTGGATCTTCTGCTATTATTTCATTTGACGTAT 
CAAACGAAGCTTCCAAGAAATACAAAATCATAAAATTAGAATTTTTAAGTCCAGATAAAGGCAATTTTATTAACCA 
GCTAAGCAGCCTTACTAGTGGAAAACAGCAATCAAAAAAAGAGCTTGCAAAAGACGCTTACTCATTTGGTACATTA 
AGAACTGAATCTCTTTCAAAAACAATTGCAGAATATTACAAAGATAACAACTGGTATTATATTTTAGCAGCAATAA 
C AGT AG AAAAT AAT ATAAAT AAAGAAAC TG AAAAAT AC G AAATT AGAATT AAC C C TAAAAT ATAT AATGATTTTC A 
AAAAAAATTGAGATTACATTTTAAAAGCAACCAAATAAAAAAATTTCCAATACCCATTATAGAATAA 

t797.nt 

AAAAAT AGCTTTTCTGAAAACGAAATT AAT ATCTTCGAAAACGAAAATT AT ATTGTAAAAG AAAAT AT A^ 

AAATTAAAAAACTAAAACAAAGTTTTTTACTTGCATCTGTTGATGTCGCCATTAGCCAACCCTACATAGAATTGGC 

AGATTTAAATGGAGAACCGATAAAAGAACTTGAAGGGATTAGTTATTCATTTATAAATGTATTTTCAAAAATTGGA 

TCTTGTGCTATTATTTCATTTGACCTATCAAACGAAGCTTCCAAGAAATACAAAATCATAAAATTAGAATTTTTAA 

GTCCAGATAAAGGCAATTTTATTAACCAGCTAAGCAGCCTTACTAGTGGAAAACAGCAATCAAAAAAAGAGCTTGC 

AAAAGACGCTTACTCATTTGGTACATTAAGAACTGAATCTCTTTCAAAJ^CAATTGCAGAATATTACAAAGATAAC 

AACTGGTATTATATTTTAGCAGCAATAJ^CAGTAGAAAATAATATAAATAAAGAAACTGAAAAATACGAAATTAG^ 

TTAACCCTAAAAT AT AT AATGATTTTC AAAAAAAATTGAGATTACATTTTAAAAGCAACCAAATAAA^ 

AAT AC CCATTATAGAAT AA 

f799.aa " • 

MKKHIIIGIIFVAILLFFKILLIPRIQNHENNKNNII^ 

IESNLLYIS3KNFTTYANIIYQNESLLSII-LKSNGMNNVFYSKRIKPRGKI 

t799.aa ^ 

KENNKNNIKMIISYKQDKNRLSLKINIK^ 
IILKSNGNNNVFYSKRIKPRGKI 
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f799.nt 

ATG AAAAAAC AT ATC ATT AJTTGGG AT AA-T C TTT G TTGC AAT T C T TTT ATT TTTT AAAATTT TAT T AATT C C C AG AA 
TT C AAAATC AC G AAAATAAT AAAAA.T AAT A.TC AAAA.IG AT AA.T AA.GC TAG AAGC AAG AC AAAAAC AG AT TAT C GC T 
AAAGA.TAAACATAAAAACAA^AAAAA.CTA.CCAA 

ATT G AAAGC AATTTGC TT T AT AT AA.GC A.GC AAAAAG TT T AC A- C AT ATGC T AAT AT AATC T AT C AAAATG AAAG TT 
T ATT AAG T AT AAT AJTT AAA.G AG T AATGOC AA.I AAT AATGT C T T T TA.TAGTAAAAG AAT AAAAC C T AG AGGT AAAAT 
ATGA ' . * . , 

t799.nt 

C AC G AAJ^T AAT AAAAA.T AAjT ATC AAAwATG AT AA_T AA.GC T AC AAGC AA.G A.C AAAAAC AG AT TAT C GC T AAAG AT AA 
AC AT AAAAAC AAAAAAAAC T AC C AAC CTGGG AAAAGC C AAAC T AGA.T A.TTT ATC T AGAC AGTAAATT AATTGAAAG 
C AAT TTG C T TT AT AT AA.GC AGC AAAAA.C TTT AC AA.C AT ATGC T AAT AT AA.TC T AT C AAAAT G AAAGTTT ATT AAGT 
ATAATATTAAAGAGT>_ATGGCAA_TAATAAjTG'~CTTTTAT 

f800 .aa 

MKKHYKAL I LS LLF AI Z 507TKTLNZLGEEQFKI ?FGTL?GAIM?I^^n<^TNS KFD I KT YMGLVY IAS I KTNKLMI 
FNSYGKLIQTYQNGIFKTNPDLKIKKIDEEGIQA.IYPLKDFI^^ 

E I LGQ EG LNGMP F PQ I YDVNVD ENGNI A_J I S I Y 3 EG Y 1 1 Y S YNXE r S ? L YK X YVNKNL L KT I DNQ KKK YN I S I DKV 
FFEVNKKTLYVKTTYYZNIGDNZNI>^LGIXIXDQYIY^^ 

D K 1 I AS TNMKNL SNNL E WKLDS KG S I KZQ I.AI IE? PITuMF L S E 3 L 3 KEG I L S I L YGGKTG VSVYWWNLNAL L KL 
t800 .aa 

KTLNELGEEQFKIPFGCLPGAIM?LN>n<IFTN3XF^ 

LKIKKIDF EG I Q A I Y ? LKD F 1 1 VAD KL>I2TKX3 XFNQXZN I AYF^ER I L I LNXNS S VE I LGQ EG LNGMP F PQ I YD VNV 

DENGNIAIISIYSEGYIIYSYNXSFSPLYXI^^^ 

NENINDLGIKIKDQYITXMSLKXNKEL^ 

SKGSI KEQIAL I E P PNI-MFL5ESL S XDGZL S I L YGGKTG VS VY/frDTLNALLKL . 
f800.nf 

ATGAAAAAACACTAT^AAGCTCTTATAJT^ 

AATT AGGAGAAG AAC AATTT AAAAT AC C ATTTGGAAC AC TTCCTGGTGC AAT AATGC C TC TG AAT AAC AAATTT AC 

AAATTCAAAATTTGACA.TCAAAA.CGTATAACGGGCTAGTGTA.CA.TTGCA 

TTCAACTCATACGGAAAACTAATACAAA-CAJTATCAA^ 

AAATAGATTTTGAAGGAJ^TTC AA.GC A\TAjTA.CCCAXTAAAAGATTTTATTA.TTGTCGC AGAC 

AAAATCAAAATTCA^CCAAAAAGAGAATAJTTGCCTA.CTTCATGAGA 

GAAATTTTGGGTCA^GAAJSGTTTAAACGOAAIG^ 

AC ATT GC AAT AAT ATC AAT AT AT AGC G A_AGG AT A.T AT AAT AT AT T C TT AC AAT AAAG AAT T TT C C C C GC T TT AT AA 
AATTTACGTCAACAAAAACCTGTTAAAAACAATAGACAATC 

TTTTTTG AAGTC AAC AAAAAAACTC TTTA-TGT AA.AAACT AC TT AC T ATGAAAAC ATTGGTG AC AATGAAAATAT AA 
AC GAT C T TGG AAT TAAAATT AAAG ATC AAT A-T ATG TAT AAAATG A.GTTTG AAAAAAAAC AAAG AAT T AG AAGTG AT 
AAAT AAAATTGCTCTTCC T AAAAAC TTA:C TA_GA.TGATAAACAAGAAA.GC TTTATAAAC ATT AT AAAAAT AC AAAAA 
G AC AAAAT AAT AGC ATC T AC T AAT ATGAAAAATTT AT C T AAC AATTT AAT ATGG AAATT AG AC AGC AAGGGC TC AA. 
TTAAAGAAC AAAT AGCTTT AATTG AGCC TC C AAATTT AATGTTTC TC TC TGAGAGTTT ATC T AAAGATGG AAT ACT 
.TAGTATACTTTATGGCGGAAAAACTGGTGTTAGT^ 

t800.nt . 

AAAAC T TT AAAC G AATT A.GG AG AAG AACAA.TTT AAAA.T AC C AT TTGG AAC AC T TC C TGGTGC AAT AATGC C T C TG A 
ATAACAAATTTACAAATTCAAAATTTGA.CATCAAAACGTATAACGGGCTA 

T AAAT TAATGATTTT C AA.C T C AT AC GC- .-AAA. C T AAT AC AAA.C AT AT C AAAATGG AAT AT TT AAAAC AAA.C C C CG AT 
TTA^AAA.TAAAAAAAATAGATTTTGAAG-G AATT C AA.GC AAT AT AC C C AC T AAAAG ATTTT ATT ATTGTCGC AGAC A 
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AACTAAATAATAAAAAATCAAAATT 

AAACTCATCTGTAGAAATTTTGGGTCAAGAAGGTTTAAACGGAATGCCATTTCCACAAATTTATGATGTTAATGTT 

GATGAAAATGGCAACATTGCAATAATATCAATATATAGCGAAGGATATATAATATATTCTTACAATAAAGAATTTT 

CCCCGCTTTATAAAATTTACGTCAACAAAAACCTGTTAAAAACAATAGACAATCAAAAGAAAAAATACAACATTTC 

AATAGATAAGGTTTTTTTTGAAGTCAACAAAAAAACTCTTTATGTAAAAACTACTTACTATGAAAACATTGGTGAC 

AATGAAAATATAAACGATCTTGGAATTAJ^AATTAAAGATCAATATATCTATAAAATGAGTTTGAAAAAAAA 

AATTAGAAGTGATAAATAAAATTGCTCTTCCTAAAAACTTACTAGATGATAAACAAGAAAGCTTTATAAACATTAT 

AAAAATACAAAAAGACAAAATAATAGCATCTACTAATATGAAAAATTTATCTAACAATTTAATATGGAAATTAGAC 

AGCAAGGGCTCAATTAAAGAACAAATAGCTTTAATTGAGCCTCCAAATTTAATGTTTCTCTCTGAGA.GTTTATCTA 

AAGATGGAATACTTAGTATACTTTATGGCGGAAAAACTGGTGTTAGTGTTTACTGGTGGAATTTAAATGCATTATT 

AAAATTATAA ' 

f 810 .aa * ' 

MYKLFLFFIIFMFLSCDEKKSSKNLKSVKIGYVNWGG 

GTVSSWPTADKFYYEKLKTKFVDLGANYEGTIQGFWPSWPISSISELKGKGDKFKNKMIGIDAGAGTQIVTEQ. 
ALNYYGLSKEYELVPSSESVMLASLDSSIKRNEWILVPL^ 

NDDFDAYYVFDHFYWSDDLILPLMDKNDKEPGKEYRNAVEFVEK^JKEIVKTWVPEKYKTLFD 
t810.aa 

CDERKSS KNLKSVKIGYVNWGGETAATNVLKVV^ 
EKLKTKFVDLGANYEGTIQGFWPSWPISSISELKGKG^ 

SSESWLASLDSSIKRNEWILVPLWKPHWAFSRYDIKFLDDPDLIMGGIESVHTLVRLGLENDDFDAYYVFDHFYW 
SDDLILPLMDKNDKEPGKEYRNAVEFVEKNKEIVKTWVPEKYKTLFD 

f810.nt 

ATGTATAAATTATTTTTATTTTTTATTATTTTTATGTTTTTGTCTTGTGATGAAAAAAAGAGTTCAAAGAATTTAA 

AATCGGTAAAAATTGGATATGTGAATTGGGGTGGAGAAACGGCAGCTACAAATGTATTAAAGGTTGTTTTTGAGAA 

AATGGGCTACAATGCAGAAATATTTTCAGTTACTACGTCTATAATGTATCAATACTTAGCATCTGGAAAGATAGAC 

GGTACGGTGTCTTCTTGGGTTCCTACAGCCGATAAATTTTATTATGAAAAACTGAAAACAAAGTTTGTTGATCTTG 

GTGCAAATTATGAAGGAACCATTCAAGGTTTTGTGGTGCCAAGCTATGTTCCAATTTCCAGCATTAGTGAGCTTAA 

GGGTAAAGGTGATAAGTTTAAAAACAAAATGATTGGCATAGA.TGCTGGTGCGGGAACTCAAATTGTTACAGAACAA 

GCGCTTAATTATTATGGATTAAGTAAAGAGTATGAGCTAGTTCCTTCAAGTGAGAGTGTTATGCTTGCAAGTTTAG 

ATTCTTCAATAAAGAGAAACGAATGGATTTTAGTTCCTTTGTGGAAGCCTCATTGGGCTTTTTCTAGGTATGATAT 

TAAGTTTCTTGATGATCCTGATTTAATTATGGGGGGAATTGAGAGCGTGCATACTCTTGTTAGACTTGGTCTTGAA 

AATGATGATTTTGATGCATATTATGTTTTTGATCATTTTTATTGGAGCGATGATTTAATATTGCCCTTAATGGATA 

AAAA/TGATAAAGAGCCAGGCAAAGAATACCGCAATGCGGTTGAATTTGT^ 

GTGGGTTCCAGAAAAATATAAGACCTTATTTGATTAA 

t810.nt ' 

TGTGATGAAAAAAAGAGTTCAAAGAATTTAAAATCGGTAAAAATTGGATATGTGAATTGGGGTGGAGAAACGGCAG 
CTACAAATGTATTAAAGGTTGTTTTTGAGAAAATGGGCTACAATGCAGAAATATTTTCAGTTACTACGTCTA.TAAT 
GTATCAATACTTAGCATCTGGAAAGATAGACGGTACGGTGTCTTCTTGGGTTCCTACAGCCGATAAATTTTATTAT 
GAAAAACTGAAAACAAAGTTTGTTGATCTTGGTGCAAATTATGAAGGAACCATTCAAGGTTTTGTGGTGCCAAGCT 
ATGTTCCAATTTCCAGCATTAGTGAGCTTAAGGGTAAAGGTGA.TAAGTTTAAAAACAAAATGATTGGCA.TAGATGC 
TGGTGCGGGAACTCAAATTGTTACAGAACAAGCGCTTAATTATTATGGATTAAGTAAAGAGTATGAGCTAGTTCCT 
TCAAGTGAGAGTGTTATGCTTGCAAGTTTAGATTCTTCAATAAAGAGAAACGAATGGATTTTAGTTCCTTTGTGGA 
AGCCTCATTGGGCTTTTTCTAGGTATGATATTAAGTTTCTTGATGATCCTGATTTAATTATGGGGGGAATTGAGA.G 
CGTGCATACTCTTGTTAGACTTGGTCTTGAAAATGATGATTTTGATGCATATTATGTTTTTGATCA.TTTTTA.TTGG 
AGCGATGATTTAATATTGCCCTTAATGGATAAAAATGATAAAGAGCCAGGCAJUVGAATACCGCAATGCGGTTGAAT 
TTGTTGAAAAGAATAAAGAGATTGTAAAGACGTGGGTTCCAGAAAAATATAAGACCTTATTTGATTAA 



f 814 .aa 
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MLVKRIVGKPITMLILFSLLLMISLYTFSRLKVDLLPGIDIPQISIHTVYPGASPREVEESVSRVLESGLSSVKNL 
KNIYSVSSKESSTVSLEFYHGTDLDLVLNEIRDALELVKSSLPSKSQTPRIFRYNLKNIPVMEIVINSVRPVSELK 
RYADEIIKPGLERLDGVAIVTVNGGSKKRVLIEVSQNRLESYGLSLSRISSIIASQNLELSAGNILENNLEYLVEV 
■SGKFKSIEEIGNWIAYKIPDISSGINLSPIEIKLKDIANIKTDFEDLSEYVEYNGLPSISLSVQKRSDSNSIAVS 
NVV^EIEKLKLSMPKDMKLEIASDSTDFIKASISTVVNSAYFGAMLAIFVIFFFLRSFRATIIIGISIPIAIVLT 
FCLMYFVNISLNIMS LAG LALG IGMVVDC S I VVIDNIYKYRQKGAKL I SSS I LGAQEMMLPITSSTFTSICVFGPF 
LIFKSELGVYGDFFKDFTFTIVISLGVSLLVAIFLVPVtjSSHYVGLYTSFQKNIKNAFIRKIDAFFASIYYFLEFL 
YINLLNIVLNHKLIFGLIVFFSFIGSLLLGLLLDVTTFTRGKENSITINLNFPHKTNLEYAKFYSNRFLEIVKSEA 
KGYKS I IATLRADRITFNVLF PLKEESRDNLTQSVDYDS IKYKIMNRIGNLYPEFNIEPS I SGNALGGGDS IKIKI 
SANDFEYIKDYGKILVSMLKKEIPELVNPRLSISDFQLQ 

EKGLNYDIVLKLDRMDVKNLKDLEKIFITNSSGVKIPFSSIATFEKTNKAESIYRENQALTIYLNAGISPDDNLTQ 
VTAKWDFINNKVPHKEGITLKVEGEYN^ 

VLIHFLAGEKLSIFAAIGMLMLVGVVVNTGIVLVDYTGLLIKRGFGLREAIIESCRSRLRPILMSSLTSIIGLIPM 
AFSSGSGNELLKPIAFTFIGGMTASTFLTLFFIPMLFEIFPTCFKFQI 

t814.aa 

RLKVDLLPGIDIPQISIHTVYPGASPREVEESVSRVLESGLSSVKNLKNIYSVSSKESSTVSLEFYHGTDLDLVLN 
EIRDALELVKSSLPSKSQTPRIFRYNLKNIPVMEIVINSVRPVSELKRYADEIIKPGLERLDGVAIVTWGGSKKR 
VLIEVSQNRLESYGLSLSRISSIIASQNLELSAGNILENNLEYLVEVSGKFKSIEEIGNWIAYKIPDISSGINLS 
' PIEIKLKDIANIKTDFEDLSEYVEYNGLPSISLSVQKRSDSNSIAVSNVVMNEIEKLKLSMPKDMKLEIASDSTDF 
IKASISTVVNSAYFGAMLAIFVIFFFLRSFRATIIIGISIPIAIVLTFCLMYFVNISLNIMSLAGLALGIGMVVDC 
SIWIDNIYKYRQKGAKLISSSILGAQEMMLPITSSTFTSICVFGPFLIFKSELGVYGDFFKDFTFTIVISLGVSL 
LVAIFLVPVLSSHYVGLYTSFQKNrKNAFIRKIDAFFASIYYFLEFLYINLLNIVLNHKLIFGLIVFFSFIGSLLL 
GLLLDVTTFTRGKENSITINLNFPHKTNLEYAKFYSNRFLEIVKSEAKGYKSIIATLRADRITFNVLFPLKEESRD 
NLTQSVDYDSIKYKIMNRIGNLYPEFNIEPSISGNALGGGDSIKIKISANDFEYIKDYGKILVSMLKKEIPELVNP" 
RLSISDFQLQIGVEIDRALVYNYGIDMNTIL^ 

NSSGVKIPFSSIATFEKTNKAESIYRENQALTIYLNAGISPDDNLTQVTAKVVDFINNKVPHKEGITLKVEGEYNE 
FSNIMNQFKIIIMMAI IVVFGIMASQFESFLKPFI IIFTIPLTAIGVVLIHFLAGEKLSIFAAIGMLMLVGVVVNT 
GIVLVDYTGLLIKRGFGLREAIIESCRSRLRPILMSSLTSIIGLIPMAFSSGSGNELLKPIAFTFIGGMTASTFLT 
LFFIPMLFEIFPTCFKFQI 

f814.nt 

ATGTTGGTAAAGAGAATAGTTGGCAAACCAATAACAATGTTGATTTTATTTTCATTGTTATTGATGATAAGTTTGT 
ATACCTTTTCAAGATTAAAAGTAGATCTTTTGCCGGGAATTGACATTCCCCAAATAAGTATTCACACTGTTTATCC 
TGGCGCTTCTCCTAGAGAAGTTGAAGAGAGTGTTTCTAGAGTCCTTGAGAGTGGCTTGAGTTCGGTAAAGAATTTA 
AAAAATATATATAGTGTATCTTCCAAAGAAAGCAGCACCGTTTCACTTGAATTTTATCATGGAACCGATTTAGATT 
TGGTTTTAAATGAAATTCGAGATGCTCTTGAATTGGTAAAATCTTCATTGCCCAGCAAATCACAGACCCCAAGAAT 
TTTTAGATACAATCTTAAAAACATCCCTGTAATGGAAATTGTTATTAATTCTGTAAGGCCAGTTTCTGAGCTTAAA 
AGATATGCCGATGAAATCATTAAACCTGGGCTTGAAAGGCTTGATGGAGTTGCAATTGTTACTGTTAATGGTGGAA 
GTAAAAAGCGTGTTTTAATTGAAGTTTCTCAAAACAGGCTGGAGTCTTATGGGCTTTCTTT.GTCAAGAATATCTTC 
AATTATAGCATCCCAAAATTTGGAACTTTCAGCTGGCAATATATTGGAGAACAACTTGGAATATTTGGTT.GAAGTT 
TCTGGAAAATTTAAATCAATTGAAGAGATAGGTAATGTGGTCATAGCTTATAAGATACCCGACATTTCTTCTGGCA 
TAAATTTATCTCCTATTGAGATAAAACTCAAAGATATTGCTAATATTAAAACCGATTTTGAAGATTTGTCTGAATA 
TGTTGAATATAATGGGTTGCCTTCAATTTCTTTGTCGGTTCAAAAACGTAGTGATTCTAATTCTATTGCAGTTTCT 
AATGTTGTTATGAATGAAATAGAAAAATTGAAATTATCTATGCCTAAAGATATGAAATTGGAGATTGCTTCTGATA 
GTACTGATTTTATTAAAGCATCCATTTCAACGGTTGTAAATTCAGCCTATTTTGGGGCCATGCTTGCAATATTTGT 
TATTTTTTTCTTTTTAAGAAGCTTTAGGGCCACAATAATTATTGGAATTTCTATTCCAATAGCAATTGTTTTGACC 
TTTTGTTTAATGTATTTTGTAAATATTTCTCTTAATATTATGAGTCTTGCGGGTCTTGCACTTGGGATTGGAATGG 
TTGTTGACTGTTCAATTGTTGTAATAGACAATATATACAAATATAGGCAAAAAGGAGCAAAGCTTATTTCGTCTTC 
TATTCTCGGAGCTCAGGAGATGATGTTGCCTATTACATCTTCAACTTTTACTTCTATTTGTGTTTTTGGTCCATTT 
CTTATTTTCAJKATCAGAACTTGGGGTATATGGAGATTTTTTCAAAGACTTTACATTTACGATTGTTATTTCCTTGG 
GTGTTTCTCTTTTAGTTGCAATTTTTTTGGTTCCTGTTTTATCAAGCCACTATGTCGGTTTATACACAAGTTTCCA 
AAAGAATATTAAGAATGCTTTTATTAGGAAAATCGATGCCTTTTTTGCTAGTATTTATTATTTTTTAGAGTTTTTG 
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TATATCAATTTATTAAATATAGTTTTAAATCACAAATTGATTTTTGGGTTGATTGTTTTTTTTAGTTT,TATTGGCA 

GCTTGCTTTTAGGATTATTGTTAGATGTGACAACTTTTACTAGAGGGAAAGAGAACTCAATTACTATTAATTTAAA 

TTTTCCCCACAAAACTAATTTGGAATATGCAAAATTTTATTCTAATAGATTTTTAGAAATTGTAAAAAGTGAGGCT 

AAAGGATATAAAAGTATTATTGCTACTTTGCGTGCTGATAGA.ATAACTTTCAACGTATTGTTTCCTCTCAAAGAAG 

AATCAAGAGATAA.TTTAACCCAAAGCGTAGATTACGATTCTATTAAATATAAAATTATGAATCGTATTGGTAATCT 

TTATCCTGAATTTAATATTGAGCCTTCCATTAGTGGCAATGCTTTAGGTGGTGGAGATTCTATTAAAATTAAAATT 

TCGGCCAATGATTTTGAATATATAJVAAGATTATGGAAAAJ^TTTTAGTTTCCATGTTAAAAAAGGAAA 

TTGTAAATCCAAGGCTTAGCATAAGTGATTTTCAGCTTCAAATTGGCGTTGAGATAGACAGAGCGCTAGTTTATAA 

TTATGGTATTGACATGAATACCATTTTAAATGAGTTGAAGGCCAATATTAATGGTGTTGTTGCTGGGCAATATGTG 

GAGAAGGGACTTAATTATGATATTGTTCTTAAGCTTGATAGAATGGATGTTAAAAATTTAAAAGATTTAGAAAAAA 

TATTTATTACAAATTCATCTGGAGTTAAAATTCCTTTTTCATCAA.TAGCCACCTTTGAAAAAACCAATAAAGCCGA 

ATCTATTTACAGAGAAAATCAAGCTTTAACCATTTATCTTAATGCGGGTATTTCTCCAGATGATAATTTAACCCAA 

GTAACCGCAAAAGTTGTAGATTTTATTAATAATAAGGTGCCCCATAAAGAAGGCATAACTCTTAAGGTTGAAGGAG 

.AATATAATGAATTTTCAAATATCATGAATCAGTTTAAAATAATCATTATGATGGCTATTATTGTTGTGTTTGGTAT 
TATGGCTTCTCAATTTGAATCTTTTTTAAAACCCTTTATTATTATTTTTACAATTCCTTTAACGGCAATAGGGGTT 
GTGCTTATACATTTTCTTGCAGGAGAAAAGCTTTCTATTTTTGCTGCAATTGGTATGCTTATGCTTGTTGGTGTTG 
TGGTAAATACAGGAATTGTTCTTGTAGACTATACTGGTTTATTGATCAAGAGGGGATTTGGCCTAAGAGAAGCAAT 
- TATTGAATCTTGTCGTTGAAGGCTTAGGCCAATTTTAATGTCTTCTTTGACCTCAATAATAGGGCTTATTCCAATG 

■ GCATTTTCTAGCGGAAGTGGAAATGAACTTCTAAAACCAATTGCATTTACTTTTATTGGCGGAATGACAGCTA.GCA 
CATTTCTTACTTTGTTTTTTATTCCCATGCTTTTTGAAATTTTTCCAACATGTTTCAAGTTTCAAATCTAG 

t314'.nt 

AGATTAAAAGTAGATCTTTTGCCGGGAATTGACATTCCCCAAATAAGTATTCACACTGTTTATCCTGGCGCTTCTC 
CTAGAGAAGTTGAAGAGAGTGTTTCTAGAGTCCTTGAGAGTGGCTTGAGTTCGGTAAAGAATTTAAAAAATATATA 
TAGTGTATCTTCCAAAGAAAGCAGCACCGTTTCACTTGAATTTTATCATGGAACCGATTTAGATTTGGTTTTAAAT 
GAAATTCGAGATGCTCTTGAATTGGTAAAATCTTCATTGCCCAGCAAATCACAGACCCCAAGAATT.TTTAGATACA 
ATCTTAAAAACATCCCTGTAATGGAAATTGTTATTAATTCTGTAAGGCCAGTTTCTGAGCTTAAAAGATATGCCGA 
TGAAATCATTAAACCTGGGCTTGAAAGGCTTGATGGAGTTGCAATTGTTACTGTTAATGGTGGAAGTAAAAAGCGT 
GTTTTAATTGAAGTTTCTCAAAACAGGCTGGAGTCTTATGGGCTTTCTTTGTCAAGAATATCTTCAATTATAGCAT 
CCCAAAATTTGGAACTTTCAGCTGGCAATATATTGGAGAACAACTTGGAATATTTGGTTGAAGTTTCTGGAAAATT 
TAAATCAATTGAAGAGATAGGTAATGTGGTCATAGCTTATAAGATACCCGACATTTCTTCTGGCATAAATTTATCT 
CCTATTGAGATAAAACTCAAAGATATTGCTAATATTAAAACCGATTTTGAAGATTTGTCTGAATATGTTGAATATA 
ATGGGTTGCCTTCAATTTCTTTGTCGGTTCAAAAACGTAGTGATTCTAATTCTATTGCAGTTTCTAATGTTGTTAT 
GAATGAAATAGAAAAATTGAAATTATCTATGCCTAAAGATATGAAATTGGAGATTGCTTCTGATAGTACTGATTTT 
ATTAAAGCATCCATTTCAACGGTTGTAAATTCAGCCTATTTTGGGGCCATGCTTGCAATATTTGTTATTTTTTTCT 
TTTTAAGAAGCTTTAGGGCCACAATAATTATTGGAATTTCTATTCCAATAGCAATTGTTTTGACCTTTTGTTTAAT 
GTATTTTGTAAATATTTCTCTTAATATTATGAGTCTTGCGGGTCTTGCACTTGGGATTGGAATGGTTGTTGACTGT 
TCAATTGTTGTAATAGACAATATATACAAATATAGGCAAAAAGGAGCAAAGCTTATTTCGTCTTCTATTCTCGGAG 
CTCAGGAGATGATGTTGCCTATTACATCTTCAACTTTTACTTCTATTTGTGTTTTTGGTCCATTTCTTATTTTCAA 
ATCAGAACTTGGGGTATATGGAGATTTTTTCAAAGACTTTACATTTACGATTGTTATTTCCTTGGGTGTTTCTCTT 
TTAGTTGCAATTTTTTTGGTTCCTGTTTTATCAAGCCACTATGTCGGTTTATACACAAGTTTCCAAAAGAATATTA 
AGAATGCTTTTATTAGGAAAATCGATGCCTTTTTTGCTAGTATTTATTATTTTTTAGAGTTTTTGTATATCAATTT 
ATTAAATATAGTTTTAAATCACAAATTGATTTTTGGGTTGATTGTTTTTTTTAGTTTTATTGGCAGCTTGCTTTTA 
GGATTATTGTTAGATGTGACAACTTTTACTAGAGGGAAAGAGAACTCAATTACTATTAATTTAAATTTTCCCCACA 
AAACTAATTTGGAATATGCAAAATTTTATTCTAATAGATTTTTAGAAATTGTAAAAAGTGAGGCTAAAGGATATAA 
AAGTATTATTGCTACTTTGCGTGCTGATAGAATAACTTTCAACGTATTGTTTCCTCTCAAAGAAGAATCAAGAGAT 
AATTTAACCCAAAGCGTAGATTACGATTCTATTAAATATAAAATTATGAATCGTATTGGTAATCTTTATCCTGAAT 
,TTAATATTGAGCCTTCCATTAGTGGCAATGCTTTAGGTGGTGGAGATTCTATTAAAATTAAAATTTCGGCCAATGA 
TTTTGAATATATAAAAGATTATGGAAAAATTTTAGTTTCCATGTTAJ^AAAAGGAAATTCCCGAACTTGTAAATCCA 
AGGCTTAGCATAAGTGATTTTCAGCTTCAAATTGGCGTTGAGATAGACAGAGCGCTAGTTTATAATTATGGTATTG 
ACATGAATA.CCATTTTAAATGAGTTGAAGGCCAATATTAATGGTGTTGTTGCTGGGCAATATGTGGAGAAGGGACT 
TAATTATGATATTGTTCTTAAGCTTGATAGAATGGATGTTAAAAATTTAAAAGATTTAGAAAAAATATTTATTACA 
AATTCATCTGGAGTTAAAATTCCTTTTTCATCAATAGCCACCTTTGAAAAAACCAATAAAGCCGAATCTATTTACA 
GAGAAAATCAAGCTTTAACCATTTATCTTAATGCGGGTATTTCTCCAGATGATAATTTAACCCAAGTAACCGCAAA 
AGTTGTAGATTTTATTAA.TAATAAGGTGCCCCATAAJVGAAGGCATAACTCTTAAGGTTGAAGGAGAATATAATGAA 
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TTTTCAAATATCATGAATCAGTTTAAAATAATCATTATGATGGCTATTATTGTTGTGTTTGGTATTATGGCTTCTC 
AATTTGAATCTTTTTTAAAACCCTTTATTATTATTTTTACAATTCCTTTAACGGCAATAGGGGTTGTGCTTATACA 
TTTTCTTGCAGGAGAAAAGCTTTCTATTTTTGCTGCAATTGGTATGCTTATGCTTGTTGGTGTTGTGGTAAATACA 
GGAATTGTTCTTGTAGACTATACTGGTTTATTGATCAAGAGGGGATTTGGCCTAAGAGAAGCAATTATTGAATCTT 
GTCGTTCAAGGCTTAGGCCAATTTTAATGTCTTCTTTGACCTCAATAATAGGGCTTATTCCAATGGCATTTTCTAG 
CGGAAGTGGAAATGAACTTCTAAAACCAATTGCATTTACTTTTATTGGCGGAATGACAGCTAGCACATTTCTTACT 
TTGTTTTTTATTCCCATGCTTTTTGAAATTTTTCCAACATGTTTCAAGTTTCAAATCTAG 

f818.aa 

MLKNHSKLIIQLKWMMIYLKKMGN^^ 

GEKHGNGVWPEENFILI IYTSNQSIVERLKDIVDDLNRSYPTEG INLFVLRNS 
t818.aa 

KKMGNDMTKFYNYRIEIVSNLSLELDVFECI^ 

SNQSIVERLKDIVDDLNRSYPTEGINLFVLRNS ' . 
f818.nt 

ATGTTGAAAAATCATTCAAAATTAATAATTCAACTAAAAGTAGTTATGATGATTTATTTGAAGAAGATGGGGAATG 
ATATGACTAAATTTTATAATTATAGGATTGAAATAGTTTCTAACTTATCTTTAGAGCTTGATGTTTTTGAATGTAT 
AGAAAAAATAGAGCAAGAGTTAGGAGAGTCTATATATTATTCTAAGATAGGAAATGTTTATGGAAAAGGTAAGAAG 
GGAGAAAAGCATGGTAATGGCGTTTGGCCTGAAGAAAATTTTATTTTGATTATTTATACCTCCAATCAGTCTATTG 
TTGAGCGATTAAAGGATATTGTGGATGATTTGAATGGTTCTTACCCTACAGAAGGGATTAATCTTTTTGTTTTGAG 
AAATTCTTAA * ' ' ■ 

t818.nt 

AAGAAGATGGGGAATGATATGACTAAATTTTATAATTATAGGATTGAAATAGTTTCTAACTTATCTTTAGAGCTTG 
ATGTTTTTGAATGTATAGAAAAAATAGAGCAAGAGTTAGGAGAGTCTATATATTATTCTAAGATAGGAAATGTTTA 
TGGAAAAGGTAAGAAGGGAGAAAAGCATGGTAATGGCGTTTGGCCTGAAGAAAATTTTATTTTGATTATTTATACC 
TCCAATCAGTCTATTGTTGAGCGATTAAAGGATATTGTGGATGATTTGAATCGTTCTTACCCTACAGAAGGGATTA 
ATCTTTTTGTTTTGAGAAATTCTTAA 

f 820 .aa . v - . ■ . 

MLNNTYRIKTILTIFLAITLLTIYKYFTLMAFNNSPDNTISLKSNDIAKRGTIYDRNGKPIAFSSKSYSIGTNPQK 
IENIVSTSETLGAILQINSRILKEKLSSNKGFLYIKRKIKREESDLIKRIQAEGRLSNITLYPDYTRIYPFRJSTTTS 
NITGFVGTDNLGLEGIEFSLNSILGKDKTKQQFLNE^ 

NSQNGEILSMVQFPQYDANFYSKYPEEIRKNLSSSLTYEPGSINKIFTVAIILESGKLNLEEKFLDNGIYQKQFPS 
GEKITIKTLNPPYKHIDSTEILIYSS1WGIAYITEKVSNEYFYKKLLDFGFGEKVGVPFPGETKGLLNHYSKWSGR 
SKATIGFGQEIGVSAVQILQAASILSIWGIMLKPRIIKKI^ 

NKGGIPNLKIKj^LDISAKS'GTSQAIDRKTGKYSEEDYTSSII^IYPTEQPKYIIYIVYRYPKKIIYGTRIAAPMAK 
EI IEFIEHQQNTIAYKKIKMPSKIKIPKAETNYKNKTYLPNFINLSKREAIDILKYYKNTMKIKINGDGFVYKQSI 
SPNTKLEDITELELYLK 

t820.aa , 

FNNSPDNTISLKSNDIAKRGTIYDRNGKPIAFSSKSYSIGTNPQKIENIVSTSETLGAILQINSRILKEKLSSNKG 

FLYIKRKIKREESDLIKRIQAEGRLSNITLYPDYTRIYPFRNTTSNITGFVGTDNLGLEGIEFSLNSILGKDKTKQ 

QFLNEEPETNNIHLTIDMDIQQGVSKIAKKYFKENNPESLITLVMNSQNGEILSMVQFPQYDANFYSKYPEEIRKN 

LSSSLTYEPGSINKIFTVAIILESGKLNLEEKFLDNGIYQKQFPSGEKITIKTLNPPYKHIDSTEILIYSSN^/GIA 

YITEKVSNEYFYKKLLDFGFGEKVGVPFPGETKGLLNHYSKW 

LKPRIIKKISNDKGENIKEFDKEEIRKVISKNSAQKVLKMMREVV^ 
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YSEEDYTSSILAIYPTEQPKYIIYIVYRYPKKIIYGTRIAAPMAKEIIEFIEHQQNTIAYKKIKMPSKIKIPKAET 
•NYKNKTYLPNFINLSKREAIDILKYYKNTMKIKINGDGFVYKQSISPNTKLEDITETLELYLK 

f820.nt 

ATGCTTAATAACACTTATCGAATAAAAACAATATTAACAATATTCTTGGCTATAACTTTGTTAACTATTTACAAAT 
ATTTCACACTAATGGCCTTCAATAACAGCCCAGACAACACAATATCTTTAAAGTCAAATGATATTGCCAAAAGAGG 
AACAATTTATGATAGAAATGGCAAACCAATAGCATTCTCTTCAAAATCCTACTCAATTGGTACAAATCCTCAAAAA 
ATAGAAAATATTGTAAGCACATCTGAAACTCTTGGTGCAATACTTCAAATTAATTCAAGAATTTTAAAGGAAAAGC 
TTTCCTCTAACAAAGGGTTTTTATATATAAAAAGAAAAATAAAAAGAGAAGAATCAGATTTAA 

AGCTGAAGGCAGGCTTTCAAACATCACTTTATATCCTGATTACACAAGAATTTATCCCTTCAGGAATACCACAAGC 
AATATTACTGGTTTTGTAGGAACAGATAATCTTGGCCTTGAGGGCATTGAATTTTCCCTAAATAGCATATTAGGAA 
AAGATAAAACCAAGCAACAATTTTTiy^TGAGGAGCCAGAAACAAACAACATCCACTTAAC 
ACAACAAGGTGTTAGCAAAATAGCTAAAAAATACTTTAAAGAAAATAATCCTGAAAGTTTAAT 

AACTCCCAAAATGGAGAAATATTATCCATGGTTCAATTTCCTCAATATGATGCAAACTTTTATTCTAAATATCCTG 
AAGAAATCCGAAAAAACCTTTCTTCATCTCTAACCTATGAGCCCGGAAGCATTAATA^ 

AATATTAGAAAGTGGAAAATTAAATTTAGAAGAAAAATTTTTAGACAATGGAATATATCAAAAACAATTTCCATCA. 
GGAGAAAAAATTACAATCAAAACATTAAATCCCCCCTATAAACATATCGACTCTACAGAGATTTTAATTTATTCAT 
CAAATGTTGGAATAGCTTACATTACTGAAAAAGTTAGCAATGAATACTTTTATAAAAAACTTTTAGATTTTGGCTT 
TGGGGAAAAAGTTGGAGTTCCATTTCCCGGAGAAACAAAAGGACTGCTAAATCATTATTCAAAATGGTCAGGACGA 
AGTAAAGCTACAATTGGATTTGGACAAGAAATAGGAGTGTCAGCGGTTCAAATATTACAAGCTGCAAGCATACTAA 
GCAATAATGGAATAATGCTAAAACCTAGAATAATAAAAAAAATAAGCAACGATAi^G 

TGATAAAGAAGAAATAAGAAAAGTAATATCCAAAAATTCAGCACAAAAAGTTTTAAAAATGATGAGAGAAGTTGTA • 
AATAAAGGTGGAATTCCAAATCTTAAAATTAAAAATCTTGACATTTCTGCAAAAAGTGGAACAT ' 
ATAGAAAAACGGGAAAATACTCAGAAGAAGACTATACATCTTCTATATTGGCAATATACCCCACAGAACAACCAAA 
ATATATTATTTACATTGTATACAGATACCCAAAAAAAATAATATACGGAACAAGAATAGCAGCCCCAATGGCAAAA 
GAAATAATAGAATTTATTGAGCACCAACAAAATACAATAGCATATAAAAAAATTAAAATC 

TCCCTAAAGCTGAAACTAATTACAAAAACAAAACATACTTACCAAATTTTATCAACCTTTCTAAAAGAGAAGCAAT. 
AGACATACTAAAATACTATAAAAATACTATGAAAATAAAAATAAATGGCGATGGATTTGTTTACAAGCAAAGTATA 
TCCCCCAATACAAAATTAGAAGATATAACAGAGCTTGAACTGTATTTAAAATAA 

t320 .nt 

TTCAATAACAGCCCAGACAACACAATATCTTTAAAGTCAAATGATATTGCCAAAAGAGGAACAATTTATGATAGAA 
ATGGCAAACCAATAGCATTCTCTTCAAAATCCTACTCAATTGGTACAAATCCTCAAAAAATAGAAAATATTGTAAG - 
CACATCTGAAACTCTTGGTGCAATACTTCAAATTAATTCAAGAATTTTAAAGGAAAAGCTTTCCTCTAACAAAGGG 
TTTTTATATATAAAAAGAAAAATAAAAAGAGAAGAATCAGATTTAATAAAAAGAATTCAAGC . . 

-CAAACATCACTTTATATCCTGATTACACAAGAATTTATCCCTTCAGGAATACCACAAGCAATATTACTGGTTTTGT 
AGGAACAGATAATCTTGGCCTTGAGGGCATTGAATTTTCCCTAAATAGCATATTAGGAAAAGATAAAACCAAGCAA 
CAATTTTTAAATGAGGAGCCAGAAACAAACAACATCCACTTAACAATAGACATGGATATACAACAAGGTGTTAGCA 
AAATAGCTAAAAAATACTTTAAAGAAAATAATCCTGAAAGTTTAATTACCTTGGTAAT 

■AATATTATCCATGGTTCAATTTCCTCAATATGATGCAAACTTTTATTCTAAATATCCTGAAGAAATCCGAAAAAAC 
CTTTCTTCATCTCTAACCTATGAGCCCGGAAGCATTAATAAAATTTTTACAGTTGCAATAATATTAGAAAGTGGAA 
AATTAAATTTAGAAGAAAAATTTTTAGAGAATGGAATATATCAAAAACAATTTCCATCAGGAGAAAAAATTACAAT 
CAAAACATTAAATCCCCCCTATAAACATATCGACTCTACAGAGATTTTAATTTATTCATCAAATGTTGGAATAGCT. 
TACATTACTGAAAAAGTTAGCAATGAATACTTTTATAAAAAACTTTTAGATTTTGGCTTTGGGGAAAAAGTTGGAG 
TTCCATTTCCCGGAGAAACAAAAGGACTGCTAAATCATTATTCAAAAT.GGTCAGGACGAAGTAAAGCTACAATTGG ■ 
ATTTGGACAAGAAATAGGAGTGTCAGCGGTTCAAATATTACAAGCTGCAAGCATACTAAGGAATAATGGAATAATG 
CTAJU^CCTAGAATAATAAAAAAAATAAGCAAGGATAAAGGAGAAAATATTAAAGAA 
GAAAAGTAATATCCAAAAATTCAGCACAAAAAGTTTTAAAAATGAT 

AAATCTTAAAATTAAAAATCTTGACATTTCTGCAAAAAGTGGAACATCTCAAGCTATTGATAGAAAAA 
T AC TC AG AAGAAGACTAT AC ATC TTC TAT ATTGGC AAT AT AC C C C AC AGAAC AACC AAAAT AT ATT ATTT AC ATTG 
TAT AC AG AT AC C C AAAAAAAAT AATAT ACGGAAC AAGAAT AGC AGCC C C AATGGC AAAAGAAAT AATAGAATTTAT 
TGAGC ACC AAC AAAAT AC AAT AGC AT AT AAAAAAATTAAAATGCC ATC AAAAATC AAGATC CCTAAAGCTGAAAC T 
AATT AC AAAAAC AAAAC AT ACTTACC AAATTTTATC AAC CTTTC T AAAAG AG AAGC AAT AG AC AT AC T AAAAT AC T 
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ATAAAAATACTATGAAAATAAAAATAAATGGCGATGGATTTGTTTACAAGCAAAGTATATCCCCCAATACAAAATT 
AGAAGATATAACAGAGCTTGAACTGTATTTAAAATAA 

f831.aa 

MAKNNLLVFFIAIIFVFVSIIVVFYNSM 

SSYFISRFYLARAVYFQSQAQYDEA.IKDLDIVIKAKGIESEIAFLNKAAVYEKMGLKEDALLVYEDLINSTSLDFL 
KVRALLSKAILIEEKDKELAVKWEEIVKFPYENNLYINMANNKILELKQN 

t831.aa 

YNSLGKDYVICSGGEIVENLEKDLNDYLKENDAKEREK^ 

AIKDLDIVIKAKGIESEIAFLNKAAVYEKI4GLKEDALLVYEDLINSTSLDFLKVRALLSKAILIEEKDKELAV 
EEIVKFPYENNLYINMANNKILELKQN' 

£831. nt t 

ATGGCTAAAAATAATCTTTTAGTTTTCTTTATTGCTATTATTTTTGTGTTTGTGTCTATTATTGTTGTTTTTTATA 

ATTCTTTAGGCAAGGATTATGTAAAGAGTGGCGGAGAAATAGTAGAAAATCTTGAAAAAGATTTAAATGATTATTT 

AAAAGAAAATGATGCCAAAGAGAGAGAAAAAATATTTCTTAGGATAAGGGAGCTTATTTCAAAGGAAAAAGAAAT 

TCATCTTATTTTATTTCAAGGTTCTATTTAGCCAGAGCTGTTTATTTCCAAAGTCAAGCACAGTATGATGAGGCTA 

TTAAAGATTTAGATATTGTTATTAAGGCAAAAGGTATTGAAAGTGAAATTGCTTTTCTTAATAAAGCTGCAGTTTA 

TGAAAAAATGGGATTAAAAGAAGATGCTTTGTTAGTTTATGAAGATCTTATCAATAGTACTAGTTTGGATTTTTTA 

AAGGTAAGAGCTCTTTTGAGTAAGGCAATATTGATTGAGGAAAAAGATAAAGAGCTTGCTGTGAAAGTATACGAAG 

AGATTGTTAAGTTTCCGTATGAAAATAATTTATATATAAATATGGCAAATAATAAAATTTTAGAACTTAAGCAAAA 

TTAA 

t831.nt 

TATAATTCTTTAGGCAAGGATTATGTAAAGAGTGGCGGAGAAATAGTAGAAAATCTTGAAAAAGATTTAAATGATT 
ATTTAAAAGAAAATGATGCCAAAGAGAGAGAAAAAATATTTCTTAGGATAAGGGAGCTTATTTCAAAGGAAAAAGA 
AATTTCATCTTATTTTATTTCAAGGTTCTATTTAGCCAGAGCTGTTTATTTCCAAAGTCAAGCACAGTATGATGAG 
GCTATTAAAGATTTAGATATTGTTATTAAGGCAAAAGGTATTGAAAGTGAAATTGCTTTTCTTAATAAAGCTGCAG 
TTTATGAAAAAATGGGATTAAAAGAAGATGCTTTGTTAGTTTATGAAGATCTTATCAATAGTACTAGTTTGGATTT 
TTTAAAGGTAAGAGCTCTTTTGAGTAAGGCAATATTGATTGAGGAAAAAGATAAAGAGCTTGCTGTGAAAGTATAC 
GAAGAGATTGTTAAGTTTCCGTATGAAAATAATTTATATATAAATATGGCAAATAATAAAATTTTAGAACTTAAGC 
AAAATTAA ' 

f843.aa ' 

MKAiIGNAILLNMPLIFSIGISIGVARMGQGTAALGGLIGYLTFNITENYFIEAFSGLVEAETMSSVGRI 

LNTGIAGSLAVGLLVGYLHNKFYNMKLPKPFVFFSECHFVPIVIILPFCVFLAIFFCLIWSSFDDLIASLGLFVFR 

FEYFGSFLYGFLNRLLLPLGLHSILSFPFEFTSLGGVEIVNGDTVRGLKNIFYAQLLDPSLGKFSSGFAKISSGFY 

LSIMFGLPGAALGVYKGIVHEDKNKVAALLFSGALTAFLTGITEPLEFLFIFTAPLLYFVHAAYSGFALLLANFFN 

VTIGNSFSTGFLDFFMFGILQGNSKTNWISVLPLGAMFFALYYFTF3WLYRYFDFQIFVTDDPFFEGQEGKLESLG* 

IAHLLIQGLGGFDNITKLDVCSTRLHVDWNT^ 

SLFEA.SVMVAVDNVKKGFKTYIEMKEDKKIiEKQGKSGKTYKLSELEED 
t843.aa 

RMGQGTAALGGLIGYLTFNITENYFIE^FSGLVEAETMSSVG 

KLPKPFVFFSECHFVPIVIILPFCVFLAIFFCLIWSSFDDLIASLGLFVFRFEYFGSFLYGFLNRLLLPLGLH3IL 
SFPFEFTSLGGVEIVNGDTVRGLKNIFYAQLLDPSLGKFSSGFAKISSGFYLSIMFGLPGAALGVYKGIVKEDl^K 
VAALLFSGALTAFLTGITEPLEFLFIFTAPLLYFVH^^ 

TNWISVLPLGAMFFALYYFTFSWLYRYFDFQIFVTDDPFFEGQEGKLESLGIAiiLLIQGLGGFDNXTKLDVCSTRL 
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HVDVVNTELVDINTNLLKEAGVLKIGL 
EDKKLEKQGKSGKTYKLSELEED 

f843.nt 

ATGAAGGCTATAGGCAATGCTATTCTTCTCAATATGCCTTTAATTTTTTCTATTGGAATTTCTATTGGAGTTGCAA 
GAATGGGGCAGGGAACAGCGGCTTTGGGAGGCCTTATTGGTTATTTAACATTTAATATTACTGAAAATTATTTTAT 
TGAGGCTTTTTCAGGGCTTGTTGAAGCAGAGACAATGTCTTCTGTTGGGCGTATAAATTTTTTTGGTGTTCAAACT 
TTAAATACGGGAATTGCAGGTTCTTTAGCGGTAGGCCTTTTAGTTGGATATTTGCATAACAAATTTTATAATATGA 
AGCTACCCAAACCTTTTGTGTTTTTTTCAGAGTGCCATTTTGTGCCTATAGTAATAATTTTACCCTTTTGTGTTTT 
TTTGGCTATATTTTTTTGTTTGATTTGGTCAAGTTTTGACGATTTAATTGCATCTTTAGGTTTGTTTGTTTTTAGG 
TTTGAATATTTTGGCAGTTTTCTTTATGGATTTTTAAATAGGCTTTTATTGCCTTTGGGGTTGC-ATTCTATTTTAT 
CTTTTCCTTTTGAGTTTACTTCTTTGGGAGGAGTGGAGATAGTTAATGGCGATACTGTTAGAGGTCTTAAGAATAT 
ATTTTATGCTCAGCTATTAGACCCATCACTTGGTAAATTTTCATCAGGCTTTGCCAAAATTAGCAGTGGATTTTAT 
CTATCTATTATGTTTGGACTGCCCGGAGCAGCATTAGGGGTTTACAAGGGTATTGTTCATGAAGATAAAAATAAGG 
TTGCAGCACTTCTTTTCTCTGGGGCCTTGACAGCTTTTTTAACAGGAATAACTGAGCCTTTAGAATTTTTATTTAT 
TTTCACAGCGCCTTTGCTTTATTTTGTTCATGCCGCTTATTCGGGGTTTGCATTGTTGCTTGCTAATTTTTTTAAT 
GTTACGATTGGCAATAGCTTTTCTACTGGATTTTTGGATTTTTTTATGTTTGGGA.TACTTCAAGGAAATTCTAAGA 
CAAATTGGATTAGTGTATTACCTTTGGGGGCAATGTTTTTTGCTCTTTATTATTTTACTTTTAGTTGGCTTTATAG 
ATACTTTGATTTTCAGATATTTGTTACAGACGATCCATTTTTTGAAGGCCAAGAAGGAAAGCTAGAGAGTCTCGGA 
ATTGCGCATCTTTTAATTCAAGGTCTTGGTGGATTTGATAATATTACAAAGCTTGATGTTTGTTCTACAAGATTGC 
ATGTAGATGTTGTTAATACTGAGCTTGTTGATAATAATTTGCTTAAAGAGGCTGGAGTTCTTAAAATAGGGCTTGT 
TAATGGCAAGGTTCAGCTTTTTTATGGATCTAATGTTTATTATATTAAAAATGCCATTGATACCTATTCTCCAAAG 
AGTCTTTTTGAAGCTAGTGTTATGGTTGCAGTTGATAATGTAAAAAAAGGTTTTAAAACTTATATTGAAATGAAAG 
AAGACAAAAAACTTGAAAAGCAAGGTAAATCAGGAAAAACCTATAAGCTTAGCGAATTAGAAGAAGATTAG 

t843.nt 

AGAATGGGGCAGGGAACAGCGGCTTTGGGAGGCCTTATTGGTTATTTAACATTTAATATTACTGAAAATTATTTTA 

TTGAGGCTTTTTCAGGGCTTGTTGAAGCAGAGACAATGTCTTCTGTTGGGCGTATAAATTTTTTTGGTGTTCAAAC 

TTTAAATACGGGAATTGCAGGTTCTTTAGCGGTAGGCCTTTTAGTTGGATATTTGCATAACAAATTTTATAATATG 

AAGCTACCCAAACCTTTTGTGTTTTTTTCAGAGTGCCATTTTGTGCCTATAGTAATAATTTTACCCTTTTGTGTTT 

TTTTGGCTATATTTTTTTGTTTGATTTGGTCAAGTTTTGACGATTTAATTGCATCTTTAGGTTTGTTTGTTTTTAG 

GTTTGAATATTTTGGCAGTTTTCTTTATGGATTTTTAAATAGGCTTTTATTGCCTTTGGGGTTGCATTCTATTTTA 

TCTTTTCCTTTTGAGTTTACTTCTTTGGGAGGAGTGGAGATAGTTAATGGCGATACTGTTAGAGGTCTTAAGAATA. 

TATTTTATGCTCAGCTATTAGACCCATCACTTGGTAAATTTTCATCAGGCTTTGCCAAAATTAGCAGTGGATTTTA 

TCTATCTATTATGTTTGGACTGCCCGGAGCAGCATTAGGGGTTTACAAGGGTATTGTTCATGAAGATAAAAATAAG 

GTTGCAGCACTTCTTTTCTCTGGGGCCTTGACAGCTTTTTTAACAGGAATAACTGAGCCTTTAGAATTTTTATTTA 

TTTTCACAGCGCCTTTGCTTTATTTTGTTCATGCCGCTTATTCGGGGTTTGCATTGTTGCTTGCTAATTTTTTTAA 

TGTTACGATTGGCAATAGCTTTTCTACTGGATTTTTGGATTTTTTTATGTTTGGGATACTTCAAGGAAATTCTAAG 

-ACAAATTGGATTAGTGTATTACCTTTGGGGGCAATGTTTTTTGCTCTTTATTATTTTACTTTTAGTTGGCTTTATA 

GATACTTTGATTTTCAGATATTTGTTACAGACGATCCATTTTTTGAAGGCCAAGAAGGAAAGCTAGAGAGTCTCGG 

AATTGCGCATCTTTTAATTCAAGGTCTTGGTGGATTTGATAATATTACAAAGCTTGATGTTTGTTCTACAAGATTG 

CATGTAGATGTTGTTAATACTGAGCTTGTTGATAATAATTTGCTTAAAGAGGCTGGAGTTCTTAAAATAGGGCTTG 

TTAATGGCAAGGTTCAGCTTTTTTATGGATCTAATGTTTATTATATTAAAAATGCCATTGATACCTATTCTCCAAA 

GAGTCTTTTTGAAGCTAGTGTTATGGTTGCAGTTGATAATGTAAAAAAAGGTTTTAAAACTTATATTGAAATGAAA ' 

GAAGACAAAAAACTTGAAAAGCAAGGTAAATCAGGAAAAACCTATAAGCTTAGCGAATTAGAAGAAGATTAG 

f850.aa 

MRFKKIFLIIFIISNLKVYSYNYAIQYKNEGIDKYYFEILNDGFGFSLSDFFDDLRSGSLIFTYVSKYNFIINLEA 

HMLTYRGYKDSPKSLISRTDLIEIGF^'YFPILLLINGKNFGEIDLGIGVXNLLFGDWGGHLMQSXIHLILNQKRP 

IPSIKSYDSYNYRGFLSFALNYSYMNFLNLENYMDLSYFADYFIKNSIGITLKI'JENIGFDIKLYSQIQNQIKSLKT 

YSKTQEAETGIGINYQFYSKNFFITNNLNIKNFSTKENFLSVGGFG 

IMIPLKIRNSLFYKINE^NHYFSISTJvmf^^ 

KSISRPIRIKNILQVGIENELGFLFKMLKYRNTEYIFXIYSKVNYIPIAY^ILDEKKLEKHSINFNYLGIGIVVK 
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t850.aa 

YSYNYAIQYKNEGIDKYYFEILNDGFGFSLSDFFDDLRSGSLIFTYVSKYNFIINLEAHMLT.YRGYKDSPKSLISR 
TDLIEIGFMYYFPILLLXNGKNFGEIDLGIGVKNLLFGDWGGHLMQSIIHLILNQHRPIPSIKSYDSYNYRGFLSF 
ALNYSYMNFLNLENYMDLSYFADYFIKNSIGITLKNENIGFDIKLYSQIQNQIKSLKTYSKTQEAETGIGINYQFY 
SraJFFITNNLNIKNFSTKENFLSVGGFGIIITPEEYKKISESNNEFNVISNNFYFGFDIMIPLKIRNSLFYKINEN 
INHYFSISTNYYTNYNETNSFTNQLSSGIMYEFLPQKTFNPYLISGLFFAYNQNNKDIKSISRPIRIKNILQVGIE 
NELGFLFKMLKYRNTEYIFKIYSKVJSTYIPIAYNLDEKKLEKHSINFNYLGIGIVVK 

f850.nt . 

ATGCGGTTTAAAAAAATATTTTTAATAATATTTATAATTTCTAATTTAAAlAGTTTATTCTTATAATTATGCAATCC 
AATATAAAAATGAAGGTATTGACAAATATTATTTTGAAATACTAAATGATGGATTCGGATTTTCATTAAGCGATTT 
TTTTGATGACTTGAGAAGTGGTTCTCTTATTTTTACCTATGTTTCAAAATACAATTTTATAATAAATTTAGAAGCA 

' CACATGTTAACCTATAGGGGTTATAAAGACTCTCCGAAATCTTTAATTAGTAGAACAGACTTAATTGAAATAGGCT 
T C ATGT AC T ATTTTC C AATTTT ATTGC T AATT AATGG AAAAAATTTTGG AG AAAT AG AC TTGGGAATTGG AGTT AA 
AAACTTATTATTTGGAGACTGGGGAGGGCATTTAATGCAAAGCATAATTCACCTCATTTTAAATCAACACCGTCCA 
ATTCCAAGTATAAAAAGCTACGACAGCTACAATTATAGAGGATTTTTAAGCTTTGCTCTAAATTACTCTTACATGA 
ATTTTTTAAATTTAGAAAATTATATGGACTTATCTTATTTTGCAGATTATTTTATTAAAAACAGTATTGGAATTAC 
CTTAAAAAATGAAAATATTGGATTTGATATAAAACTTTATTCCCAAATTCAAAATCAAATCAAAAGCCTCAAAACA 

" TATTCAAAAACACAAGAAGCAGAAACAGGAATTGGAATAAATTATCAJ^TTTTACTCTAAAAATTTTTTCATAACCA 
ATAATTTAAACATTAAAAATTTTTCAACCAAAGAAAATTTCTTAAGCGTTGGGGGATTTGGAATAATCATTACACC 
TGAAGAATACAAAAAAATATCAGAATCAAATAATGAATTTAATGTTATAAGTAATAATTTTTACTTTGGATTTGAT 
ATTATGATCCCATTAAAAATAAGAAATTCATTATTTTATAAAATAAATGAAAACATCAACCATTACTTTTCAATAT 
CAACAAATTATTACACTAATTATAATGAAACTAATAGCTTTACAAATCAATTATCATCAGGCATCATGTATGAATT 
TTTACCACAAAAAACATTCAATCCTTACCTAATTTCGGGATTATTTTTTGCCTATAATCAAAACAATAAAGATATC 
AAAAGCATCTCAAGACCAATAAGAATAAAAAACATTCTTCAAGTTGGAATTGAAAATGAATTAGGATTTTTGTTCA 
AAATGCTAAAATACCGCAACACTGAGTATATTTTCAAAATATATTCAAAAGTTAACTATATTCCTATAGCTTATAA 
CTTAGATGAAAAAAAATTAGAAAAACATTCTATTAACTTTAATTATTTAGGAATTGGAATAGTCGTTAAATAA 

£850 .nt 

TATTCTTATAATTATGCAATCCAATATAAAAATGAAGGTATTGACAAATATTATTTTGAAATACTAAATGATGGAT 
TCGGATTTTCATTAAGCGATTTTTTTGATGACTTGAGAAGTGGTTCTCTTATTTTTACCTATGTTTCAAAATACAA 
TTTTATAATAAATTTAGAAGCACACATGTTAACCTATAGGGGTTATAAAGACTCTCCGAAATCTTTAATTAGTAGA 
ACAGACTTAATTGAAATAGGCTTCATGTACTATTTTCCAATTTTATTGCTAATTAATGGAAAAAATTTTGGAGAAA 
TAGACTTGGGAATTGGAGTTAAAAACTTATTATTTGGAGACTGGGGAGGGCATTTAATGCAAAGCATAATTCACCT 
CATTTTAAATCAACACCGTCCAATTCCAAGTATAAAAAGCTACGACAGCTACAATTATAGAGGATTTTTAAGCTTT" 
GC TC T AAATT AC TC TT AC ATG AATTTTTT AAATTT AGAAAATT AT ATGG AC TT ATCTT ATTTTGC AGATT ATTTT A 
TTAAAAACAGTATTGG^TTACCTTAAAAAATGAAAATATTGGATTTGATAT 

TCAAATCAAAAGCCTCAAAACATATTCAAAAACACAAGAAGCAGAAACAGGAATTGGAAT AAATT ATCAATTTT AC 
TCTAAAAATTTTTTCATAACCAATAATTTAAACATTAAAAATTTTTCAACCAAAGAAAATTTC 

GATTTGGAATAATCATTACACCTGAAGAATACAAAAAAATATCAGAATCAAATAATGAATTTAATGTTATAAGTAA 
" TAATTTTTACTTTGGATTTGATATTATGATCCCATTAAAAATAAGAAATTCATT ATTTT AT AAAATAAATGAAAAC 
ATCAACCATTACTTTTCAATATCAACAAATTATTACACTAATTATAATGAAACTAATAGCTTTACAAATCAATTAT 
CATCAGGCATCATGTATGAATTTTTACCACAAAAAACATTCAATCCTTACCTAATTTCGGGATTATTTTTTGCCTA 
TAATCAAAACAATAAAGATATCAAAAGCATCTCAAGACCAATAAGAATAAAAAACATTCTTCAAGTTGGAATTGAA 
AATGAATTAGGATTTTTGTTCAAAATGCTAAAATACCGCAACACTGAGTATATTTTCAAAATATATTCAAAAGTTA 
ACTATATTCCTATAGCTTATAACTTAGATGAAAAAAAATTAGAAAAACATTCTATTAACTTTAATTATTTAGGAAT 
TGGAATAGTCGTTAAATAA 

f353.aa 

MKSFLFWVILGTVGTSSFAQNTPVAIINLYKNEIITKTGFDSPCVT)IFKKTQGRDLTDAEKkQVLQVLIADVLFSQE 
ASKQGIKISDDEVMQTIRTQFGLVNFTDEQIKQMIEKQGTNWGELLSSMKRSLSSQKLVLKQAQPKFSEIKTPSEK 
EIVEYYEANKTKFVNPDISRVSHIFFSTKDKKRSDVLDQAKNILSQIRSKKITFEEAV^KYSbnDESSKAKNGDLGF 
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LSRGDQNAQNLLGADFVKEVFNFNKGDISSPIASKEGFM 
VQQQQIWQVQQDMYGKLNKSANIQILDSSLK 

t853.aa 

QNTPVAI INLYKNEI ITKTGFDSKVDI FKKTQGRIDLTD^^ 

QFGLVNFTDEQIKQMIEKQGTNWGELLSSMKRSL^^ 

RVSHIFFSTKDKKRSDVLDQAKNILSQIRSKKITFEEAVRiC/SN^ 

VFNFNKGDISSPIASKEGFHIVKVTEKYAQRFLGLOT 

KSANIQILDSSLK 

f853.nt * 

ATGAAGAGTTTTTTATTTTGGGTAATATTGGGAACTGTAGGGATTAGCTCTTTTGCTCAAAATACTCC'IGTTGCTA. 
TTATTAATTTATATAAGAATGAAATTATTACTAAAACTGGT 

AGGTAGAGACTTAACTGATGCTGAGAAAAAGCAAGTTCTGCAAGTTTTAATAGCAGATGTTCTTTTTAGTCAAGAG 
GCTTCAAAGCAAGGAATTAAAATCTCAGATGATGAGGTTATGCAAACAATTAGAACTCAATTTGGGCTCGTGAATT 
TTACTGATGAACAAATCAAGCAAATGATAGAAAAACAAGGTACAAATTGGGGCGAGCTTTTGTCTTCAA' 
ATCTCTGTCTTCTCAAAAGCTTGTTTTAAAGCAAGCTCAGCCTAAGTTTTCTGAJ^ 

GAAATTGTTGAGTATTATGAGGCTAATAAAACTAAGTTTGTAAATCCCGATATTTCAAGAGTTAGTCA~ATCTT~T 

TTTCTACTAAAGATAAAAAAAGATCA.GATGTTTTAGATCAAGGAAAAAATATTTTAAGCCAAATAAGA-r 

AATTACTTTTGAAGAAGCTGTAAGAAAATATTCAAATGACGAATCTTCTAAGGCTAAAAA-TGGTGA-TCTTGGGTTT 

TTATCAAGAGGTGATCAAAATGCTCAAAATCTTCTTGGAGCCGATTTTGTGAAAGAGGTT^TTAATTTTAA.TAAGG 

GTGATATATCTTCGCCTATTGCTTCAAAGGAAGGGTTTCATATTGTTAAAGTTAXAGAAAAATATGCTCA.GAGATT 

TTTAGGTTTGAATGATAAAGTGTCTCCTACTGCAGATTTGATTGTCAAAGATGCAATAAGAAATAACATGATTAJiT 

GTTCAACAACAGCAAATTGTTGTTCAAGTACAGCAAGATATGTATGGTAAGCTTAACAAGTCTGCAAATATACAAA 

TCTTGGATTCTAGTCTAAAATAA ' 

t853.nt • ■ 

CAAAATACTCCTGTTGCTATTATTAATTTATATAAGAATGAAATTATTACTAAAACTGGTTTTGATTCTAAGGTTG 
AT AT ATTTAAAAAGACC C AAGGT AGAGAC TT AAC TG ATGCTGAGAAAAAGC AAGTTC TGC AAGTTTT AAT AGC AGA 
TGTTCTTTTTAGTCAAGAGGCTTCAAAGCAAGGAATTAAAATCTCAGATGATGAGGTTATGCAAACAATTAGAACT 
CAATTTGGGCTTGTGAATTTTACTGATGAACAAATCAAGCAAATGATAGAAAAAXAAGGTACAAATTGGGGCGA^ 
TTTTGTCTTCAATGAAAAGATCTCTGTCTTCTCAAAAGCTTGTTTTAAAGCAAGCTCAGCCTAAGTTTTCTGAJ^T 
TAAAACTCCTAGTGAGAAAGAAATTGTTGAGTATTATGAGGCTAATAAAACTAAGTTTGTAAATCCCGATATTTCA 
AGAGTTAGTCATATCTTTTTTTCTACTAJUVGATAAAAAAAGATCAGATGTTTTAG 

GCCAAATAAGATCAAAAAAAATTACTTTTGAAGAAGCTGTAAGAAAATATTCAAATGACGAATCTTCTAAGGCTA^ 

AAATGGTGATCTTGGGTTTTTATCAAGAGGTGATCAAAATGCTCAAAATCTTCTTGdAGCCGATTTTGTGAAAGAG 

GTTTTTAATTTTAAT^GGGTGATATATCTTCGCCTATTGCTTCAAAGGAAGGGTTTCATATTGTTAAJ^GTTACAG 

AAAAATATGCTCAGAGATTTTTAGGTTTGAATGATAAAGTGTCTCCTACTGCAGATTTGATTGTCAAAGATGCAAT 

AAGAAATAACATGATTAATGTTCAACAACAGCAAATTGTTGTTCAAGTACAGCAAGATATGTATGG 

AAGTCTGCAAATATACAAATCTTGGATTCTAGTCTAAAATAA 

f859.aa ' , 

MKLPKLYKLILLFLFTTRLFSVKDEKSDNKLELFSOTETK^ 

IYIQKSKKINYPNRNLGNNINQKTANDVNFTC 

I I I KMLKTKG YT L I E Y I EDNN 

t859 .aa • * 

VKIDEKSDNKLELFSNVETKIKKNSKNYDSN^ 
QKTA 



fS59.nt 
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ATGAAATTACCAAAACTTTACAAATTAATACTACTCTTTCTTTTTACAACAAGATTGTTTTCAGTAAAAGATGAAA 

AATCAGACAATAAATTGGAATTATTTTCAAACGTAGAAACAAAAATCAAAAAAAATTCTAAAAATTACGAC 

TTCAAACAGCAAAAAGATCAAAAAAGAATCAATTTTAAAAAGA 

ATATACATACAAAAATCAAAAAAAATTAATTACCCCAACAGAAATTTAGGCAATAATATCAATCAAAAAACTGCAA 
ATGATGTAAATTTTACAAAAACTAGTTATGTTAAAGTTTATCCCAACTATAAAGACGATAACTTTCAAGAAATTAA 
AAATGCTAATAAATTTCCAGCTAAAACCGAAAAAACTCACATGCTAATCGGCCCAATATTAAAAGATAATCTAGGA 
ATAATAATTAAAATGCTAAAAACAAAGGGATACACTTTAATAGAATACATAGAGGACAATAATTAA 

t859.nt 

GTAAAAGATGAAAAATCAGACAATAAATTGGAATTATTTTCAAACGTAGAAACAAAAATCAAAAA 

ATTACGACTCAAATTCAAACAGCAAAAAGATGAAAAAAGAATCAATTTTAAAAAGAGATACAAACAGC^ 

TAT AAATTCC AAT AT AT AC AT AC AAAAATC 1 AAAAAAAATT AATT AC CC C AACAGAAATTTAGGC AATAAT ATC AAT 

CAAAAAACTGCAAATGATGTAAATTTTACAAAAACTAGTTATGTTAAAGTTTATCCCAACTATAAAGACGATAACT 

TTCAAGAAATTAAAAATGCTAATAAATTTCCAGCTAAAAGCGAA 

AGATAATCTAGGAATAATAATTAAAATGCTAAAAACAAAGGGATACACTTTAATAGAATACATAGAGGACAATAAT 
TAA 

f861.aa' ' 

MKNFKEVIIIFDSGIGGLSYFKYIKSRIGGCQYVYVADNKNFPYGEKSPEYLLEAVLFLIEKLKKIYNIGALVLAC 

NTISVSVYNKLNFVFPVVYTLPDVSSVSDLVLKRVLLI^ 

YKKYALRCLEALKFEVWTGREIWLGCTHYLHLKVM^ 

FVDDEFYLTENKNLTFYQNFCKKYNLRFKGMIV 

t861.aa 

RIGGCQYVWADNKNFPYGEKSPEYLLEAVLFLIEKLKKIYNIGALVLACNTISVSVYNKLNFVFPVV^ 
VSDLVLKRVLLIATNTTLESKFVKDQVNIH^LIVKAAGELVNFVEYGENYKKYALRCLEALKFEVWTGREIVF^ 
GCTHYLHLKVMI EDFLKI PVYENREL WKNL I RSMNFS EHKGNYYKNDFDF VDDEF YLTENKNLTF YQNFCKKYNL 
RFKGMIV ' • • 

f861.nt \ 

ATGAAAAATTTCAAAGAAGTAATAATTATTTTTGATTCAGGAATAGGAGGGCTTTCTTATTTTAAATATATTAAAA 
GTAGAATAGGGGGATGCCAATATGTTTATGTTGCCGATAATAAAAATTTCCCTTATGGAGAAAAAAGTCCTGAATA 
TCTTCTAGAAGCAGTTTTGTTTTTGATTGAGAAGCTTAAAAAAATCTATAATATTGGTGCATTAGTTTTGGCTTGT 
AATACAATTTCTGTTAGTGTATACAATAAATTAAATTTTGTTTTTCCAGTAGTCTATACTTTGCCAGATGTAAGTT. 
CAGTTTCAGATCTTGTTTTAAAAAGAGTTCTTTTGATTGCAACAAATACTACTCTTGAAAGCAAATTTGTTAAGGA 
TCAAGTAAATATACATAATGATTTGATTGTAAAAGCTGCTGGAGAGCTTGTTAATTTTGTTGAATATGGAGAGAAT 
TACAAAAAATATGCTCTTAGATGTTTAGAAGCTTTAAAATTTGAAGTTGTAAATACTGGTAGAGAAATTGTTTTTC 
TTGGATGCACGCATTATTTGCATCTTAAGGTAATGATAGAAGATTTTTTAAAAATTCCTGTTTATGAGAATCGTGA 
ATTAGTGGTAAAAAATCTT ATT AGATCAATGAATTTTTCTGAACACAAAGGT AATT ATTATAAGAATGATTTTGAT 
TTTGTAGATGATGA.GTTTTATTTGACCGAAAATAAAAATTTGACTTTTTATCAAAATTTTTGCAAAAAATATAATC 
TTCGC TTTAAGGGAATGATAGTTTGA 

. t861 .nt 

AGAATAGGGGGATGCCAATATGTTTATGTTGCCGATAATAAAAATTTCCCTTATGGAGAAAAAAG.TCCTGAATATC 
TTCTAGAAGCAGTTTTGTTTTTGATTGAGAAGCTTAAAAAAATCTATAATATTGGTGCATTAGTTTTGGCTTGTAA 
TACAATTTCTGTTAGTGTATACAATAAATTAAATTTTGTTTTTCCAGTAGTCTATACTTTGCCAGATGTAAGTTCA 
GTTTCAGATCTTGTTTTAAAAAGAGTTCTTTTGATTGCAACAAATA.CTACTCTTGAAAGCAAATTTGTTAAGGATC 
AAGTAAATATACATAATGATTTGATTGTAAAAGCTGCTGGAGAGCTTGTTAATTTTGTTGAATATGGAGAGAATTA 
CAAAAAATATGCTCTTAGATGTTTAGAAGCTTTAAAATTTGAAGTTGTAAATACTGGTAGAGAAATTGTTTTTCTT 
GGATGCACGCATTATTTGCATCTTAAGGTAATGATAGAAGATTTTTTAAAAATTCCTGTTTATGAGAATCGTGAAT 
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TAGTGGTAAAAAATCTTATTAGATCAATGAATTTTTCTGAACACAAAGGTAATTATTATAAGAATGATTTTGATTT 

TGTAGATGATGAGTTTTATTTGACCGAAAATAAAAATTTGACTTTTTATCAAAATTTTTGCAAAAAATATAATCTT 
CGCTTTAAGGGAATGATAGTTTGA 

f363.aa *' . 

MIRLKVLILCLFGIFVLNGFADTNFEFNFGGGVAFPVSPFSSFYNEALEINAKLKQNLPSDLSPIEKEEIVQNFSD 
LANIAKAGIRYGTYAQFGAKFDDFVSIGFELLFNINLLKAIKRSDGTANENFSFIMAXTPRF-YTKLDFFVLALAFF 
TGPKINIATSSADSVLAELGTMGWDIGARLSFSFLILEGYYVWN-IKNPKFSDFKFGIGFEFGIV 

t3 63.aa 

DTNFEFNFGGGVAFPVSPFSSFYNEALEINAKLKQNLPSDLSPIEKEEIVQNFSDLANIAKAGIRYGTYAQFGAKF 
DDFVSIGFELLFNINLLKAIKRSDGTANENFSFIMAITPRFYTKLDFFVLALAFFTGPKINIATSSADSVLAELGT 
MGWDIGARLSFSFLILEGYYVWNIKNPKFSDFKFGIGFEFGIV ' 

f363.nt 

ATGATTAGGCTTAAAGTTTTAATTTTGTGTTTATTTGGGATTTTTGTGTTAAATGGTTTTGCAGATACTAATTTTG 
■ AATTCAATTTTGGTGGTGGGGTTGCTTTTCCTGTTAGTCCCTTTTCAAGCTTTTACAATGAGGCTTTAGAGATTAA 
TGCAAAGCTTAAGCAAAATTTGCCTTCAGATTTATCCCCAATAGAAAAAGAAGAGATAGTCCAAAATTTTTCCGAT 
TTAGCCAATATTGCTAAAGCTGGAATAAGATATGGAACTTACGCTCAATTTGGCGCTAAATTTGATGATTTTGTTT 
CTATTGGATTTGAGCTTTTGTTTAACATTAATCTTCTTAAAGCAATAAAGCGTTCGGATGGAACTGCAAATGAAAA 
TTTCTCGTTTATTATGGCAATAACACCAAGATTTTATACAAAATTAGATTTTTTTGTTTTAGCTTTAGCGTTTTTC 
ACAGGTCCTAAGATCAATATAGCGACTTCTTCTGCGGATTCTGTTTTAGCAGAACTGGGAACAATGGGCTGGGATA 
TTGGTGCTAGACTTTCATTTTCTTTTTTAATTCTTGAAGGGTACTATGTTTGGAATATTAAAAACCCTAAATTTTC 
TGATTTCAAGTTTGGAATAGGTTTTGAATTTG 
GAATTGTGTAG 

t363 .nt* " ** * 

GATACTAATTTTGAATTCAATTTTGGTGGTGGGGTTGCTTTTCCTGTTAGTCCCTTTTCAAGCTTTTACAATGAGG 
CTTTAGAGATTAATGCAAAGCTTAAGCAAAATTTGCCTTCAGATTTATCCCCAATAGAAAAAGAAGAGATAGTCCA 
AAATTTTTCCGATTTAGCCAATATTGCTAAAGCTGGAATAAGATATGGAACTTACGCTCAATTTGGCGCTAAATTT 
GATGATTTTGTTTCTATTGGATTTGAGCTTTTGTTTAACATTAATCTTCTTAAAGCAATAAAGCGTTCGGATGGAA 
CTGCAAATGAAAATTTCTCGTTTATTATGGCAATAACACCAAGATTTTATACAAAATTAGATTTTTTTGTTTTAGC 
TTTAGCGTTTTTCACAGGTCCTAAGATCAATATAGCGACTTCTTCTGCGGATTCTGTTTTAGCAGAACTGGGAACA 
ATGGGCTGGGATATTGGTGCTAGACTTTCATTTTCTTTTTTAATTCTTGAAGGGTACTATGTTTGGAATATTAAAA 
ACCCTAAATTTTCTGATTTCAAGTTTGGAATAGGTTTTGAATTTGGAATTGTGTAG 

f3 68.aa 

MIDLTQEKQEILIKNKFLAiCVFGLM 

ISSNTATALFLLYSALTGVTLSSIFMIYTQGSIVFTFGITAGTFLGMSVYGYTTTTDLTKMGSYLIMGLWGIIIAS 

LVI^FFRSSGLNFLISILGVVIFTGLTAYDVQNISKMDKMLQDDTEIKNRjyLAVV 
RRND 

t368.aa 

TSENQTIKAIIFSNSMSFMAMILIQFGLVYAISGALNKISSNTATALFLLYSALTGVTLSSIFMIYTQGSIVFTFG 
ITAGTFLGMSVYGYTTTTDLTKMGSYLIMGLWGIIIASLVNMFFRSSGLNFLISILGVVIFTGLTAYDVQNISKMD 
KMLQDDTEIKOT^WASLKLYLDFINLFLYLLRFLGQRRND 

f368.nt 
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ATGATCGATTTAACACAAGAAAAACAAGAAATACTAATAAAAAACAAGTTTTTAGCCAAAGTTTTCGGGCTTATGT 
CAATTGGACTTTTAJVTCTCAGCAGTATTTGCATATGCAACCTCAGAAAATCAAACAATCAAAGCAATAATATTCTC 
AAATTCAATGTCATTTATGGCTATGATACTTATACAATTTGGACTTGTATATGCAATAAGTGGTGCTCTTAATAAA 
ATATCAAGCAATACTGCAACAGCTCTTTTCTTGCTCTACTCAGCACTAACAGGAGTAACATTATCTTCTATATTTA 
TGATTTACACACAAGGATCAATAGTATTCACATTCGGAAT-TACTGCTGGAACATTTCTTGGAATGTCTGTTTATGG 
ATACACTACAACAACAGATCTAACAAAAATGGGAAGCTATTTAATAATGGGCTTATGGGGAATCATTATTGCATCT 
CTTGTTAATATGTTTTTTAGAAGCTCAGGTCTTAATTTCCTTATATCTATTTTGGGCGTAGTTATATTTACAGGCT 
TAACAGCTTATGATGTTCAAAATATTTCTAAAATGGACAAAATGCTACAAGACGACACTGAAATAAAAAACAGAAT 
GGCGGTTGTAGCCTCACTTAAACTTTATTTAGATTTTATAAATTTATTCTTATATCTTCTAAGATTTTTGGGCCAA 
AGAAGAAACGATTAA 

t3 68 .nt . - " _ 

ACCTCAGAAAATCAAACAATCAAAGCAATAATATTCTCAAATTCAATGTCATTTATGGCTATGATACTTATACAAT 
TTGGACTTGTATATGCAATAAGTGGTGCTCTTAATAAAATATCAAGCAATACTGCAACAGCTCTTTTCTTGCTCTA 
CTCAGCACTAACAGGAGTAACATTATCTTCTATATTTATGATTTACACACAAGGATCAATAGTATTCACATTCGGA 
ATTACTGCTGGAACATTTCTTGGAATGTCTGTTTATGGATACACTACAACAACAGATCTAACAAAAATGGGAAGCT 
A.TTTAATAATGGGCTTATGGGGAATCATTATTGCATCTCTTGTTAATATGTTTTTTAGAAGCTCAGGTCTTAATTT 
CCTTATATCTATTTTGGGCGTAGTTATATTTACAGGCTTAACAGCTTATGATGTTCAAAATA.TTTCTAAAATGGAC 
AAAATGCTACAAGACGACACTGAAATAAAAAACAGAATGGCGGTTGTAGCCTCACTTAAACTTTATTTAGATTTTA 
" TAAATTTATTCTTATATCTTCTAAGATTTTTGGGCCAAAGAAGAAACGA'TTAA 

f371.aa 

MKFFFLLQIALILLSNSSLLFGQSPPKEKEDSLLLYKEGKFKEAILNTLEEIRLNPSNLDARTILIWSLIAIGEYK 
RAEKEAIIGLGIKKHDIRIIQALGEAYFFQKNYDNALK^ 

YEHALRFSPNNQNLLIKLARSRINAKNKILAEEALIKILTISPNNLEAKNLLEELKKSNNKP 
t371.aa . * 

EDSLLLYKEGKFKEAILNTLEEIRLNPSNLDARTILIWSLIAIGEYKRAEKEAIIGLGIKKHDIRI IQALGEAYFF 
QKNYDNALKYFQEYISLDSKGARIIKVYNLIAD^ 

LAEEALIKILTISPNNLEAKNLLEELKKSNNKP - ■ 

f 371 .nt 

ATGAAATTTTTTTTTCTATTACAAATAGCTTTAATTCTACTATCCAATTCAAGCTTGTTATTTGGACAATCACCGC 
- CTAAAG AAAAAGAAG ACTC TC TTC TTC TAT ATAAAG AAGGAAAATTTAAAG AAGC T ATTTT AAAC ACGTT AG AAGA 
AATTC G AC T AAATC C T AGTAAC TT AG ATGCTAGG AC AATATTGAT ATGG AGC TTAAT AGC CAT AGGAGAAT AC AAG 
AGAGCTGAAAAAGAGGCGATTATAGGACTTGGCATTAAAAAACATGACATAAGAATTATTCAAGCACTAGGAGAAG 
CTTATTTCTTTCAAAAAAATTATGACAATGCATTAAAATACTTTCAAGAATACATTAGCCTTGATTCTAAAGGAGC 
AAGAATAATAAAAGTTTATAATTTAATTGCAGATTCTTTTTATGAGCTAAAAAGATATAATGAAGCCGATTTTGCA 
TACGAACATGCATTACGTTTTTCTCCTAATAACCAAAATCTATTAATAAAATTAGCAAGATCAAGAATAAATGCAA 
AAAATAAAATATTAGCAGAAGAAGCACTAATTAAAJ^TTCTTACAATCTCTCCTAATAATCTAGAGGCAAAAAATTT 
A.CTAGAAGAATTAAAAAAAAGCAACAACAAACCTTGA 

t371.nt 

GAAGACTCTCTTCTTCTATATAAAGAAGGAAAA.TTTAAAGAAGCTAT-TTTAAACACGTT 

ATC C T AGTAAC TTAGATGC TAGG AC AATATTGAT ATGGAGCTTAAT AGC CAT AGGAGAAT AC AAGAGAGCTGAAAA 

AGAGGCGATTATAGGACTTGGCATTAAAAAACATGACATAAGAATTATTCAAGCACTAGGAGAAGCTTATTTCTTT 

CAAAAAAATTATGACAATGCATTAAAATACTTTCAAGAATACATTAGCCTTGATTCTAAAGGAGCAAGAATAATAA 

AAGTTTATAATTTAATTGCAGATTCTTTTTATGAGCTAAAAAGA.TATAATGAAGCCGATTTTGCATACGAACATGC 

ATTACGTTTTTCTC C T AAT AAC C AAAATCTATTAAT AAAATT AGC AAG ATC AAGAATAAATGC AAAAAAT AAAJV 

TTAGCAGAAGAAGCACTAATTAAAATTCTTACAATCTCTCCTAATAATCTAGAGGCAAAAAATTTACTAGAAGAAT 

TAAAAAAAAGCAACAACAAACCTTGA 
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f502.aa 

MKKANFLSTNFLILLLVCFVNVNLF 

LNESEIEDEAIYLGLTYNVKLNDFFYFKSELARSISILFFKNSNKKYKNTKSTFLSNFNIGVIKNTIYEDILRLKN 
VNTIFLADNSQELVLALKNDKVDYIYGD^ 

FSYLMKMPEELVFSFLDSNAKGSFVDVGLYNDYPPLSFINSQGKLSGILVDLWNLLSRQHIFKPIFKGFSKEDIKK 
SLDGKSVGIFGC^IISNDSVLENVNYWSKPIYP^ 

IVNNSYGFIENSITTKYLLKLNGYNGRLKSYDSIFNKNRFLVLAIDNRIYKVIKYILNSIFDDISFESLLQXDKNW 

LDKEEINSSRINSYKIMNKVKFNIEEKIWLSKNNKLNLAVKNWY 

KVHSSLDLKKLIKSGKIDMI^TNATO 

. KSKLILVSSFNEALLLLYKGKVDGIISDEYTAAAVFEELNIDDVEKIPTFRDLAFDLSLAIYNQDYILKEIIQKW 
' MRSNVDSQMYLNDWKFDIYYKSRSIRFKNFKFLVITFIIFYFTFL 
NAAKTIFIAWSHDIRTPINGIMAATELLDT^ 

IDLESEMEMVLKAFQSQCAKKNIDLFSYSKSIFNNYIKGDIVKIKQVLINLIGNAFKFTDDGVIVLNYEEVCRTRT 
DGNRVLVTVEFKVIDTGKGIEKENFSKIFEIFKQEDDSSSRVHEGAGLGLSISRELIRLMGGLGIAVDSKVGEGTT 
FSFMLPFLLGSELKSKKLSiNRFQSVNGDNKVLISA/LLSQKSIKIFEHCSILLGCSSNVR 
YNFWINVNNDNIQEGIRLANNIERLNSDVQI^ 

EDLVPIDSALRIKEPINVLIAEDNQVNQKVLKDILVVIGINENFIDVVDDGVKALKSLKDKKYTISF 

GFSVAKEIRKFEKAKNLKPCVLVAVTAHALQEYKDKCLASGMNDYISKPIHISSIKTILKKYLQFEVDDIGENENL 

NQLVKFPNLDVNRALKELNLSWSYSELCRGLV^ 

FQKIETSKDSISELKKMYSFVKDDLFQLISDIKENILFESEIVSENKLYFKNNDQFLNLLNKLLIGIKTR 
EILESINKYVLDDNIQVLFSDLRRNLRLYRFAESSKILEEIIEMLNNKRY 

t502 .aa ■ r 

CFVNVNLFSKDIFKFKLVTDQFFPFYYKNNKGEYEG^^ 

IWKLNDFFYFKSELARSISILFFKNSNK^^ 

KNDKVDYIYGDCKTLHYIAW^ 

SNAKGSFVDVGLYNDYPPLSFINSQGKLSGILVDLWNLLSRQHIFKPIFKGFSKEDIKKSLDGKSVGIFGGIISND 
SVLENVNYWSKPIYPLNFKFYSKDLSNDAGPIN^ 

LLKLNGYNGRLKSYDSIFNKNRFLVLAIDNRIYKVIKYILNSIFDDISFESLLQIDKNWLDKEEINSSRINSYKIM- 
NKVKFNIEEKIWLSKNNKLNLAVKNWYPIDYVEAIsnsTYKGINQFLLDKIRMF 

DMLNTNATDSNLDNVFNIKLNSRIPLYIFSNKKRVLPSRSLEKFAILDFLYSKNLASNIKSKLILVSSFNEALLLL 
YKGKVDGIISDEYTAAAVFEELNIDDVEKIPTF^ 

IYYKSRSIRFKNFKFLVITFIIFYFTFLGFVIIFMFRL'SFEQKRRYSFViyENEKKIAEAANAAKTIFIANVSHDIR^ 

PINGIMAATELLDTTILTDVQKDYVRMINYSSDSLLSLIDDIL 

CAKKNIDLFSYSKSIFNNYIKGDIVKIKQVLINL^ 

KGIEKENFSKIFEIFKQEDDSSSRVHEGAGLGLSISRELIRLMGGLGIAVDSKVGEGTTFSFMLPFLLGSELKSKK 
LSINRFQSVNGDNKVLNVLLSQKSIKIFEHC^ 

RLANNIERLNSDVQIIFLFYYLDNKALKNLKYGWKKPLMGLGICSILYKKEFNPEMDFEDLVPIDSALRIKEPIN ' 
VLIAEDNQVNQKVLKDILVVIGINENFIDVVDDGVKALKSLKDKKYTISFIDIRMPRYDGFSVAKEIRKFEK^ . 
KPCVLVAVTAHALQEYKDKCLASGMNDYISKPIHISSIKTI^ 

LNLSYVSYSELCRGLVDFISINIIDLEKAFDEEDLSLIKDISH3ISGALSNMRSELYKDFQKIETSKDSISELKKM 
YSFVKDDLFQL I SDI KENI LF ES EI VS ENKL YFKNNDQFLNLLNKLL IG I KT.RKPRS YKE I LES INKYVLDDNIQV 
L F SDLRRNLRL YRF AE S SKILEEII EMLNNKR Y ' 

f502.nt 

ATGAAAAAAGCAAACTTTTTAAGTACTAATTTTTTAATTTTACTTTTGGTTTGCTTTGTCAACGTCAATTTATTTT 

CTAAGGATATTTTCAAGTTTAAGCTTGTAGATCAATTTTTTCCTTTTTACTACAAGAATAATAAAGGAGAATATGA 

AGGACTTATTTTTTCTATTTTAGATAAATGGGCAAAAGATAATAATGCTGATATTATGGTTGAGCATATTGATAAT 

TTAAATGAAAGTGAAATTGAAGACGAAGCAATATATTTAGGATTAACTTATAATGTAAJ\ATTAAATGATTTTTTTT 

ATTTTAAAAGTGAGCTTGCTAGGAGTATTTCAATTTTATTTTTTAAAAACTCTAATAJ^ 

TTCAACATTTTTATCCAATTTTAATATAGGAGTTATTAAAAATACAATATATGAAGAT^ 

GTT AAC AC C ATTTTTTTGGCTGATAATTC TC AAG AGTTAGTATTGGC C TTAAAAAACGAT AAAGTTG ATTAT AT AT 
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ATGGTGATTGCAAGACTTTACATTATATTGCAAATAACTTTTTAAGTGAAGATCTTGTGATTTTTACCGGGGATGT 

TTTTTATAGTATCAAAAATAGAGTGGCTATTAGTAGAAATGCTCCTGAGATAGTAAAGAATTTGAATTTAGATTTG 

TTTTCATATTTAATGAAAATGCCTGAGGAACTTGTTTTTTCTTTTTTAGATAGCAATGCTAAGGGAAGTTTTGTTG 

ATGTTGGTTTATATAATGATTATCCTCCTTTAAGTTTTATTAATTCACAGGGAAAATTGTCTGGCATTTTAGTGGA 

TTTGTGGAATCTTCTCTCAAGACAACATATCTTTAAACCTATTTTTAAGGGATTTTCCAAAGAGGATATTAAGAAA 

TCATTAGATGGAAAATCAGTAGGTATTTTTGGAGGAATTATTAGCAATGATAGTGTGTTGGAAAATGTTAATTATG 

TAGTAAGTAAGCCAATATATCCTCTTAATTTTAAATTTTATTCTAAAGACCTAAGCAATGATGCTGGTCCAATAAA 

TTCTCAGTTTATTGATTTTAATTTTAATAATATTCAATTAAATAAGAATAAAGATATTGTTAATAACTTTATAGAT 

ATTGTTAATAATTCATATGGGTTTATAGAAAATTCAATAACAACAAAATATTTGTTAAAATTAAATGGATATAACG 

GTAGATTAAAATCTTACGATTCGATTTTTAATAAAAATAGGTTTTTAGTATTAGCCATTGATAATAGGATTTATAA 

GGTTATTAAATATATTCTCAATTCTATATTTGATGATATTTCATTTGAATGTTTGCTTCAAATAGATAAAAATTGG 

TTGGATAAAGAAGAGATTAATAGTTCTAGAATAAATAGTTATAAAATTATGAATAAGGTTAAATTTAATATAGAAG 

AAAAAATTTGGTTATCAAAAAATAATAAATTAAATCTTGCTGTTAAAAATTGGTATCCAATAGATTATGTTGAGGC 

AAATAATTATAAAGGAATAAATCAATTTTTGCTTGATAAGATTAGAATGTTTTCAGGTTTGAGATTTAACATAATT 

AAAGTACACAGCAGTTTAGATCTTAAAAAATTAATCAAATCTGGAAAAATCGATATGCTAAATACTAATGCAACCG 

ATTCAAATTTAGATAATGTTTTCAACATAAAATTAAATTCTCGAATTCCACTTTATATT^TTTCAAATAAGAAAAG 

GGTGCTTCCATCTAGATCTTTAGAAAAGTTTGCTATACTTGATTTTTTATATAGTAAAAATTTGGCTTCTAATATT 

AAATCAAAGCTTATTCTGGTAAGCAGTTTTAATGAAGCGTTGCTTCTTCTTTATAAGGGAAAGGTAGATGGGATTA 

TTAGCGATGAGTATACAGCTGCTGCTGTTTTTGAGGAATTAAATATTGATGATGTTGAAAAAATTCCTACTTTTAG 

AGATTTGGCTTTTGATTTGAGTCTTGCTATTTATAATCAAGATTATATCTTGAAAGAAATTATTCAAAAAGTTGTT 

ATGCGTTCAAATGTTGACAGTCAGATGTATTTAAATGATTGGAAATTTGATATTTATTATAAATCCAGAAGTATCA 

GGTTTAAAAATTTCAAATTTTTAGTGATAACATTCATTATATTTTATTTTACTTTTTTAGGATTTGTAATTATATT 

TATGTTCAGATTATCATTTGAGCAGAAAAGAAGATATTCTTTTGTGATGAATGAAAAAAAGATTGCGGAAGCCGCT 

AATGCTGCTAAAACCATTTTTATAGCCAATGTCAGTCATGATATTCGTAGCCCTATTAACGGAATAATGGCGGCTA 

CTGAGCTTTTGGATACAACTATTCTTACAGATGTTCAAAAAGATTATGTTAGGATGATAAATTATTCATCTGATTC 

TTTGCTTTCTTTAATT.GATGATATATTGTATTTGTCTAAAATAGATGTCAATGAATTATATGTTGAGAGTCAAGAG 

ATTGATTTAGAGAGTGAAATGGAAATGGTTTTAAAAGCTTTTCAATCTCAATGTGCAAAGAAAAATATTGATTTAT 

TCTCTTATTCTAAATCTATTTTTAATAATTATATAAAGGGTGATATTGTAAAAATTAAACAAGTTTTAATTAATTT 

AATAGGAAATGCTTTTAAGTTTACAGATGATGGTGTTATTGTTTTAAATTATGAAGAAGTATGTAGAACAAGAAGT 

GATGGTAATAGGGTTTTGGTTACAGTTGAATTTAAGGTAATAGATACAGGCAAAGGGATTGAAAAAGAAAATTTTT 

CTAAGATATTTGAAATATTTAAACAAGAGGATGATTCTTCTTCAAGGGTTCATGAAGGTGCAGGATTGGGATTGTC 

AATATCTAGAGAGCTTATAAGACTAATGGGTGGTCTTGGTATTGCTGTTGATAGCAAGGTGGGAGAGGGTACAACT 

TTTTCATTTATGTTGCCCTTTTTATTGGGTAGTGAGCTTAAAAGTAAAAAATTGTCAATCAATAGATTTCAATCAG 

TAAATGGTGACAATAAAGTATTAAATGTGCTTTTAAGTCAAAAATCTATTAAAATTTTTGAGCACTGTTCGATTTT 

ATTGGGATGCTCTTCTAATGTGCGCTATGTAGCGTCTTTTGAGGATGCTTATAAAGTCTTCAAGAAATACCCTTCT 

TATAATTTTGTTTATATAAATGTAAATAACGATAATATTCAAGAGGGTATTCGACTTGCCAATAATATTGAAAGAC 

TAAATTCTGATGTACAAATTATTTTTTTATTTTATTATTTAGATAATAAAGCTCTAAAAAATTTAAAATATGGTTA 

TGTTAAAAAGCCTTTAATGGGGCTTGGTATATGCTCTATTGTTTATAAAAAAGAGTTTAACCCAGAAATGGATTTT 

GAGGATTTGGTTCCAATAGATAGTGCTTTAAGGATAAAAGAGCCCATTAATGTTTTAATAGCTGAAGATAATCAGG 

TAAATCAAAAAGTGTTGAAAGATATTCTTGTTGTTATAGGCATTAATGAAAATTTTATTGATGTTGTAGATGATGG 

AGTAAAGGCTTTAAAATCTTTAAAAGATAAAAAATATACTATCTCTTTTATTGATATACGAATGCCAAGATATGAT 

GGATTTTCGGTGGCTAAGGAAATTAGAAAATTTGAAAAGGCAAAGAATTTAAAGCCTTGTGTTTTGGTTGCTGTAA 

CAGCGCATGCTTTGCAAGAGTATAAAGACAAGTGTCTTGCAAGTGGTATGAATGATTATATCTCAAAACCAATACA 

CATAAGTTCAATTAAAACTATATTAAAAAAATACTTACAGTTTGAAGTTGATGATATTGGGGAGAATGAAAATTTG 

AATCAACTTGTTAAGTTTCCTAATTTAGATGTTAATAGGGCTTTAAAAGAATTAAATCTTTCATATGTATCATATT 

CTGAATTATGTAGAGGGCTTGTTGATTTTATCTCTATTAATATTATTGATTTGGAAAAAGCTTTTGATGAGGAAGA 

TTTGTCTTTAATTAAGGATATATCTCATTCAATATCTGGAGCTCTTTCTAATATGCGTAGCGAATTGTATAAAGAT 

TTTCAAAAAATTGAAACAAGTAAAGATTCAATTTCTGAGTTGAAAAAAATGTATTCTTTTGTAAAAGATGATTTAT 

TTCAACTAATAAGCGACATAAAGGAAAATATTTTGTTTGAGTCTGAGATTGTTAGTGAGAACAAGCTATATTTTAA 

AAATAATGATCAATTTTTAAACCTTCTCAACAAACTTTTAATTGGTATTAAGACTAGAAAGCCAAGAGAATACAAA 

GAAATTCTTGAGAGCATTAATAAATATGTTTTAGACGATAATATTCAGGTATTATTTAGTGATCTTCGCAGAAATT 

T AAGATT ATATAGATTTGCTGAGAGC TC TAAGATTC TTGAAGAGATT ATTGAAATGC TTAATAAT AAGAGATATT A 

G • 



t502 .nt 
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TGCTTTGTCAACGTCAATTTATTTTCTAAGGATATTTTCAAGTTTAAGCTTGTAGATCAATTTTTTCCTTTTTACT 
ACAAGAATAATAAAGGAGAATATGAAGGACTTATTTTTTCTATTTTAGATAAATGGGCAAAAGATAATAATGCTGA 
TATTATGGTTGAGCATATTGATAATTTAAATGAAAGTGAAATTGAAGACGAAGCAATATATTTAGGATTAACTTAT 
AATGTAAAATTAAATGATTTTTTTTATTTTAAAAGTGAGCTTGCTAGGAGTATTTCAATTTTATTTTTTAAAAACT 
CTAATAAAAAATATAAJ^lAATACCCATTCAACATTTTTATCCAATTTTAATATAGGAGTTATTAAAAATACAATATA 
TGAAGATATCTTAAGGTTAAAAAACGTTAACACCATTTTTTTGGCTGATAATTCTCAAGAGTTAGTATTGGCCTTA 
AAAAACGATAAAGTTGATTATATATATGGTGATTGCAAGACTTTACATTATATTGCAAATAACTTTTTAAGTGAAG 
ATCTTGTGATTTTTACCGGGGATGTTTTTTATAGTATCAAAAATAGAGTGGCTATTAGTAGAAATGCTCCTGAGAT 
AGTAAAGAATTTGAATTTAGATTTGTTTTCATATTTAATGAAAATGCCTGAGGAACTTGTTTTTTCTTTTTTAGAT 
AGCAATGCTAAGGGAAGTTTTGTTGATGTTGGTTTATATAATGATTATCCTCCTTTAAGTTTTATTAATTCACAGG 
GAAAATTGTCTGGCATTTTAGTGGATTTGTGGAATCTTCTCTCAAGACAACATATCTTTAAACCTATTTTTAAGGG 
. ATTTTCCAAAGAGGAT ATTAAGAAATCATTAGATGGAAAATCAGTAGGTATTTTTGGAGGAATTATTAGCAATGAT 
AGTGTGTTGGAAAATGTTAATTATGTAGTAAGTAAGCCAATATATCCTCTTAATTTTAAATTTTATTCTAAAGACC 
' TAAGCAATGATGCTGGTCCAATAAATTCTCAGTTTATTGATTTTAATTTTAATAATATTCAATTAAATAAGAATAA 
AGATATTGTTAATAACTTTATAGATATTGTTAATAATTCATATGGGTTTATAGAAAATTCAATAACAACAAAATAT 
TTGTTAAAATTAAATGGATATAACGGTAGATTAAAATCTTACGATTCGATTTTTAATAAAAATAGGTTTTTAGTAT 
TAGCCATTGATAATAGGATTTA.TAAGGTTATTAAATATATTCTCAATTCTATATTTGATGATATTTCATTTGAATC 
TTTGCTTCAAATAGATAAAAATTGGTTGGATAAAGAAGAGATTAATAGTTCTAGAATAAATAGTTATAAAATTATG 
AATAAGGTTAAATTTAATATAGAAGAAAAAATTTGGTTATCAAAAAATAATAAATTAAATCTTGCTGTT^ 
GGTATCCAATAGATTATGTTGAGGCAAATAATTATAAAGGAATAAATCAATTTTTGCTTGATAAGATTAGAATGTT 
TTCAGGTTTGAGATTTAACATAATTAAAGTACACAGCAGTTTAGATCTTAAAAAATTAATCAAATCTGGAAAAATC 
GATATGCTAAATACTAATGCAACCGATTCAAATTTAGATAATGTTTTCAACATAAAATTAAATTCTCGAATTCCAC 
TTTATATTTTTTCAAATAAGAAAAGGGTGCTTCCATCTAGATCTTTAGAAAAGTTTGCTATACTTGATTTTTTATA 
TAGTAAAAATTTGGCTTCTAATATTAAATCAAAGCTTATTCTGGTAAGCAGTTTTAATGAAGCGTTGCTTCTTCTT 
TATAAGGGAAAGGTAGATGGGATTATTAGCGATGAGTATACAGCTGCTGCTGTTTTTGAGGAATTAAATATTGATG 
ATGTTGAAAAAATTCCTACTTTTAGAGATTTGGCTTTTGATTTGAGTCTTGCTATTTATAATCAAGATTATATCTT 
GAAAGAAATTATTCAAAAAGTTGTTATGCGTTCAAATGTTGACAGTCAGATGTATTTAAATGATTGGAAATTTGAT 
ATTTATTATAAATCCAGAAGTATCAGGTTTAAAAATTTCAAATTTTTAGTGATAACATTCATTATATTTTATTTTA 
CTTTTTTAGGATTTGTAATTATATTTATGTTCAGATTATCATTTGAGCAGAAAAGAAGATATTCTTTTGTGATGAA 
TGAAAAAAAGATTGCGGAAGCGGCTAATGCTGCTAAAACCATTTTTATAGCCAATGTCAGTCATGATATTCGTACC 
CCTATTAACGGAATAATGGCGGCTACTGAGCTTTTGGATACAACTATTCTTACAGATGTTCAAAAAGATTATGTTA 
GGATGATAAATTATTCATCTGATTCTTTGCTTTCTTTAATTGATGATATATTGTATTTGTCTAAAATAGATGTCAA 
TGAATTATATGTTGAGAGTCAAGAGATTGATTTAGAGAGTGAAATGGAAAT.GGTTTTAAAAGCTTTTCAATCTCAA 
TGTGCAAAGAAAAATATTGATTTATTCTCTTATTCTAAATCTATTTTTAATAATTATATAAAGGGTGATATTGTAA 
AAATTAAACAAGTTTTAATTAATTT AATAGGAAATGCTTTTAAGTTTACAGATGATGGTGTTATTGTTTTAAATTA 
TGAAGAAGTATGTAGAACAAGAACTGATGGTAATAGGGTTTTGGTTACAGTTGAATTTAAGGTAATAGATACAGGC 
AAAGGGATTGAAAAAGAAAATTTTTCTAAGATATTTGAAATATTTAAACAAGAGGATGATTCTTCTTCAAGGGTTC 
ATGAAGGTGCAGGATTGGGATTGTCAATATCTAGAGAGCTTATAAGACTAATGGGTGGTCTTGGTATTGCTGTTGA 
TAGCAAGGTGGGAGAGGGTACAACTTTTTCATTTATGTTGCCCTTTTTATTGGGTAGTGAGCTTAAAAGTAAAAAA 
TTGTCAATCAATAGATTTCAATCAGTAAATGGTGACAATAAAGTATTAAATGTGCTTTTAAGTCAAAAATCTATTA 
AAATTTTTGAGCACTGTTCGATTTTATTGGGATGCTCTTCTAATGTGCGCTATGTAGCGTCTTTTGAGGATGCTTA 
TAAAGTCTTCAAGAAATACCCTTCTTATAATTTTGTTTATATAAATGTAAATAACGATAATATTCAAGAGGGTATT 
CGACTTGCCAATAATATTGAAAGACTAAATTCTGATGTACAAATTATTTTTTTATTTTATTATTTAGATAATAAAG 
C TCT AAAAAATTT AAAAT ATGGTT ATGTT AAAAAGC C TTT AATGGGGC TTGGT AT ATGCTC T ATTC TTT AT AAAAA 
AGAGTTTAACCCAGAAATGGATTTTGAGGATTTGGTTCCAATAGATAGTGCTTTAAGGATAAAAGAGCCCATTAAT 
GTTTTAATAGCTGAAGATAATCAGGTAAATCAAAAAGTGTTGAAAGATATTCTTGTTGTTATAGGCATTAATGAAA 
ATTTTATTGATGTTGTAGATGATGGAGTAAAGGCTTTAAAATCTTTAAAAGATAAAAAATATACTATCTCTTTTAT 
TGATATACGAATGCCAAGATATGATGGATTTTCGGTGGCTAAGGAAATTAGAAAATTTGAAAAGGCAAAGAATTTA 
AAGCCTTGTGTTTTGGTTGCTGTAACAGCGCATGCTTTGCAAGAGTATAAAGACAAGTGTCTTGCAAGTGGTATGA 
ATGATTATATCTCAAAACCAATACACATAAGTTCAATTAAAACTATATTAAAAAAATACTTACAGTTTGAAGTTGA 
TGATATTGGGGAGAATGAAAATTTGAATCAACTTGTTAAGTTTCCTAATTTAGATGTTAATAGGGCTTTAAAAGAA 
TTAAATCTTTCATATGTATCATATTCTGAATTATGTAGAGGGCTTGTTGATTTTATCTCTATTAATATTATTGATT 
TGGAAAAAGCTTTTGATGAGGAAGATTTGTCTTTAATTAAGGATATATCTCATTCAATATCTGGAGCTCTTTCTAA 
TATGCGTAGCGAATTGTATAAAGATTTTCAAAAAATTGAAACAJ^GTAAAGATTCAATTTCTGAGTTGAAAAAAATG 
TATTCTTTTGTAAAAGATGA.TTTATTTCAACTAATAAGCGAC AT AAAGG AAAAT ATTTTGTTTGAGTCTGAGATTG 
TTAGTGAGAACAAGCTATATTTTAAAAA.TAATGATCAATTTTTAAACCTTCTCAACAAACTTTTAATTGGTATTAA 
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GACTAGAAAGCCAAGAGAATACAAAGAAATTCTTGAGAGCATTAATAAATATGTTTTAGACGATAATATTCAGGTA 

TTATTTAGTGATCTTCGCAGAAATTTAAGATTATATAGATTTGCTGAGAGCTCTAAGATTCTTGAAGAGATTATTG 
AAATGCTTAATAATAAGAGATATTAG 

f527.aa 

I^LLVICIAKFILILFLFTSCNQKQSEIQNLTHLLKSSNKNRLDKFLIIDRVVNIYIANKNYEDALEIVNNGIIDD^ 

SREYYPLYLYLMGNIYDSMGEDFVAFNIYKRWDNFDDYVYEN^ 

DNIJtfNEEKGNYFYNLALSLEDVQDYDESYFY^ 

DVKNFVLSGNTSKLLNIRDKNNFFIQ 

SRGWDHIHEWYFVFKRIVYPKDPEINNGWTWIGVYLGKK 
t527.m 

CNQKQSEIQNLTHLLKSSNKNRLDKFLIIDRVVNIYIANKNYEDALEIVNNGIIDDESREYYPLYLYLMGNIYDSM 
GEDFVAFNIYKR\A/DNFDDYWENHSMKTRVAKKIWLNIDSIDKINYYKFILNMGIDNLNNEEKGNYFYNLALSL 
EDVQDYDESYFYYKKFLSIPRAHLKIDSRDYFNVVTKINYFN^ 

• KNNFF IQSWDQKGGKSNS INTNS FLTTMIRLGGRRKNGIQFAKHLEADS SDDI S YLESRGWDHIHEWYFVFKRIVY 
PKDPEINNGWTWIGVYLGKK 

f 527 .nt 

ATGAATCTATTGGTCAAAATTGCTAAATTTATTTTGATTTTGTTTTTATTTACTTCTTGCAACCAAAAGCAAAGCG 
' AGATTCAAAATCTTAGACATCTTTTAAAATCTTCTAATAAAAATAGATTAGATAAATTTCTTATTATTGATAGAGT 
TGTTAACATATATATTGCAAATAAiU\ATTATGAAGATGCTTTAGAAATTGTAAATAATGGAATTATTGATGATGAA 
TCTAGAGAATATTATCCTTTGTATCTTTATTTAATGGGCAATATTTATGATTCCATGGGAGAAGATTTTGTAGCTT 
TTAATATTTACAAGCGTGTTGTTGATAATTTTGATGATTATGTTTATGAAAACCATTCAATGAAAACAAGGGTTGC 
TAAAAAGATTGTCAATTTAAATATTGATTCAATCGATAAAATCAATTATTACAAATTTATATTAAATATGGGGATT 
GATAATTTAAATAATGAGGAAAAGGGTAATTATTTTTATAATCTTGCGCTAAGTTTGGAAGATGTTCAAGATTACG 
ATGAATCTTATTTTTATTATAAAAAATTTCTTTCAATTCGAAGGGCACATTTAAAAATAGATTCTAGAGACTATTT 
TAATGTTGTTACAAAAATTAATTACTTTAATAATCCAGAGTTTGTTGTTTATAGAAATTTAGGAGATTTAATCCAG 
. GATGTTAAiy^TTTTGTTCTTTCTGGTAATACTTCTAAATTGCTTAATATAAGAGATAAGAATAATTTTTTTATTC* 
AAAGCTGGGATCAAAAGGGTGGAAAGAGTAATTCCATTAATACTAATAGCTTTTTAACCACTATGATTAGGCTTGG 
GGGGAGAAGAAAAAACGGAATACAATTTGCAAAGCATCTTGAGGCAGATTCTAGTGACGATATATCTTATCTTGAG 
TCAAGGGGCTGGGACCATATTCATGAATGGTATTTTGTTTTTAAAAGAATTGTTTATCCTAAAGATCCAGAAATTA 
ATAATGGCTGGACTTGGATAGGCGTGTATTTAGGTAAAAAATAA 

t527 .nt • - 

TGCAACCAAAAGCAAAGCGAGATTCAAAATCTTACACATCTTTTAAAATCTTCTAATAAAAATAGATTAGATAAAT 
TTCTTATTATTGATAGAGTTGTTAACATATATATTGCAAATAAAAATTATGAAGATGCTTTAGAAATTGTAAATAA 
TGGAATTATTGATGATGAATCTAGAGAATATTATCCTTTGTATCTTTATTTAATGGGCAATATTTATGATTCCATG 
GGAGAAGATTTTGTAGCTTTTAATATTTACAAGCGTGTTGTTGATAATTTTGATGATTATGTTTATGAAAACCATT 
CAATGAAAACAAGGGTTGCTAAAAAGATTGTCAATTTAAATATTG 

TATATTAAATATGGGGATTGATAATTTAAATAATGAGGAAAAGGGTAATTATTTTTATAATCTTGCGCTAAGTTTG 

GAAGATGTTCAAGATTACGATGAATCT'TATTTTTATTATAAAAAATTTCTTTCAATTCCAAGGGCACATTTAAAAA 

TAGATTCTAGAGACTATTTTAATGTTGTTACAAAAATTAATTACTT.TAATAATCCAGAGTTTGTTGTTTATAGAAA 

TTTAGGAGA'TTTAATCCAGGATGTTAAAAATTTTGTTCTTTCTGGTAATACTTCTAAATTGCTTAATATAAGAGAT 

AAGAATAATTTTTTTATTCAAAGCTGGGATCAAAAGGGTGGAAAGAGTAATTCCATTAATACTAATA 

CCACTATGATTAGGCTTGGGGGGAGAAGAAAAAACGGAATACAATTTGCAAAGCATCTTGAGGCAGATTCTAGTGA 

CGATATATCTTATCTTGAGTCAAGGGGGTGGGACCATATTCATGAATGGTATTTTGTTTTTAAAAGAATTGTTTAT 

CCTAAAGATCCAGAAATTAATAATGGCTGGACTTGGATAGGCGTGTATTTAGGTAAAAAATAA 

f541.aa 

MNKILLLILLESIVFLSCSGKGSLGSEIPKVSLIIDGTFDDKSFNESALNGVKKVKEEFKIELVLKESSSNSYLSD 
LEGLKDAGSDLIWLIGYRFSDVAKVAALQNPDMKYAIIDPIY3NDPIPANLVGMTFRAQEGAFLTGYIAAKLSKTG 
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KIGFLGGIEGEIVDAFRYGYEAGAKYANKDIKISTQYIGSFADLEAGRSVATRMYSDEIDIIHHAAGLGGIGAIEV 
AKELGSGHYIIGVTDEDQAYLAPDNVITSTTKDVG 

ELEKEIDNLSSKIINKEIIVPSNKESYEKFLKEFI 
"t541.aa 

CSGKGSLGSEIPKVSLIIDGTFDDKSFNESALNGVKKVKEEFKIELVLKESSSNSYLSDLEGLKDAGSDLIWLIGY 
RFSDVAKVAALQNPDMKYAIIDPIYSJTOPIPANLVGMTFRAQEGAFLTGYIAAKLSKTGKIGFLGGIEGEIVDAFR 
YGYEAGAKYANKDIKISTQYIGSFADLEAGRSVATRMYSDEIDIIHHAAGLGGIGAIEVAKELGSGHYIIGVDEDQ 
AYLAPDNVITSTTKDVGRALNIFTSNHLKTNTFEGGKLINYGLKEGWGFVRNPKMISFELEKEIDNLSSKIINKE 
IIVPSNKESYEKFLKE 

FI \" ' 

. f541.nt 

ATGAATAAAATATTGTTGTTGATTTTGCTTGAGAGTATTGTTTTTTTATCTTGTAGTGGTAAAGGTAGTCTTGGGA. 

GCGAAATTCCTAAGGTATCTTTAATAATTGATGGAACTTTTGATGATAAATCTTTTAATGAGAGTGCTTTAAATGG 

CGTAAAAAAAGTTAAAGAAGAATTTAAAATTGAGCTTGTT^ 

CTTGAAGGGCTTAAGGATGCGGGCTCAGATTTAATTTGGCTTATTGGGTATAGATTTAGCGATGTGGCCAAGGTTG 

CGGCTCTTCAAAATCCCGATATGAAATATGCAATTATTGATCCTATTTATTCTAACGATCCTATTCCTGGAAATTT 

GGTGGGCATGACCTTTAGAGCTCAAGAGGGTGCATTTTTAACGGGTTATATTGCTGCAAAACTTTCTAAAACAGGT 

AAAATTGGATTTTTAGGGGGAATAGAAGGCGAGATAGTAGATGCTTTTAGGTATGGGTATGAAGCTGGTGCTAAGT 

ATGCTAATAAAGATATAAAGATATCTACTCAGTATATTGGTAGTTTTGCTGAQCTTGAAGCTGGTAGAAGCGTTGC 

AACTAGGATGTATTCTGATGAGATAGACATTATTCATCATGCTGCAGGCCTTGGAGGAATTGGGGCTATTGAGGTT 

GCAAAAGAACTTGGTTCTGGGCATTACATTATTGGAGTTGA.TGAAGATCAAGCATATCTTGCTCCTGACAATGTAA 

TAACATCTACAACTAAAGATGTTGGTAGAGCTTTAAATATTTTTACATCTAACCATTTAAAAACTAATACTTTCGA 

AGGTGGCAAATTAATAAATTATGGCCTTAAAGAAGGAGTTGTGGGGTTTGTAAGAAATCCTAAAATGATTTCCTTT 

GAACTTGAAAAAGAAATTGA.CAATCTTTCTAGCAAAATAATCAACAAAGAAATTATTGTTCCATCTAA 

GTTATGAGAAGTTTCTTAAAGAATTTATTTAA 

t541.nt 

TGTAGTGGTAAAGGTAGTCTTGGGAGCGAAATTCCTAAGGTATCTTTAATAATTGATGGAACTTTTGATGATAAAT 
CTTTTAATGAGAGTGCTTTAAATGGCGTAAAAAAAGTT 

CTCATCAAATTCTTATTTATCTGATCTTGAAGGGCTTAAGGATGCGGGCTCAGATTTAATTTGGCTTATTGGGTAT 

AGATTTAGCGATGTGGCCAAGGTTGCGGCTCTTCAAAATCCCGATATGAAATATGCAATTATTGATCCTATTTATT 

CTAACGATCCTATTCCTGCAAATTTGGTGGGCATGACCTTTAGAGCTCAAGAGGGTGCATTTTTAACGGGTTATAT 

TGCTGCAAAACTTTCTAAAACAGGTAAAATTGGATTTTTAGGGGGAATAGAAGGCGAGATAGTAGATGCTTTTAGG 

TATGGGTATGAAGCTGGTGCTAAGTATGCTAATAAAGATATAAAGATATCTACTCAGTATATTGGTAGTTTTGCTG 

ACCTTGAAGCTGGTAGAAGCGTTGCAACTAGGATGTATTCTGATGAGATAGACATTATTCATCATGCTGCAGGCCT 

TGGAGGAATTGGGGCTATTGAGGTTGCAAAAGAACTTGGTTCTGGGCATTACATTATTGGAGTTGATGAAGATCAA. 

GCATATCTTGCTCCTGACAATGTAATAACATCTAGAACTAAAGATGTTGGTAGAGCTTTAAATATTTTTACATCTA 

ACCATTTAAAAACTAATAGTTTCGAAGGTGGCAAATTAATAAATTATGGCCTTAAAGAAGGAGTTGTGGGGTTTGT 

AAGAAATCCTAAAATGATTTCCTTTGAACTTGAAAAAGAAATTGACA 

ATTATTGTTCC ATCTAATAAAGAAAGTTATGAGAAGTTTCTTAAAGAATTTATTTAA 

£561. aa 

MYIWGFFKNYLSLFLIFLVIACTSKDSSNEYVEEQEAENSSKPDDSKIDEHTIGHVFHAMGVVHSKKDRKSLGKN^ 
KVFYFSEEDGHFQTI PSKENAKLI VYFYDNVYAGEAPI 3 1 SGKEAF IFVGITPDFKKI INSNLHGAKSDLIGTFKD 
LNIKNSKLEITVDENNSDAKTFLESVNYIIDGVEKISPMLTN 
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CTSKDSSNEWEEQEAENSSKPDDSKIDEHTIGHVFHAMGVVHSKKDRKSLGKNIKVFYFSEEDGHFQTIPSKENA 
KLIVYFYDNWAGEAPISISGKEAFIFVGITPDFKKIINSNLHGAKSDLIGTFKDLNIKNSKLEITVDEI^NSDAKT 
FLESVNYI IDGVEKI S PMLTN 

f561.nt ' , 

ATGTATAAAAATGGTTTTTTTAAAAACTATTTGTCATTGTTTTTAATTTTTTTAGTAATTGCTTGTACTTCAAAAG 
• ATAGCTCAAATGAATATGTTGAGGAGCAAGAAGCGGAGAACTCTTCTAAGCCTGATGATTCTAAAATAGATGAACA 
TACTATTGGGCACGTTTTTCACGCTATGGGAGTAGTTCATTCAAAAAAGGATCGAAAAAGTTTGGGGAAAAATATA 
AAGGTTTTTTATTTTTCTGAAGAAGATGGACATTTTCAAACAATACCCTCAAAAGAGAATGCAAAGTTAATAGTTT 
ATTTTTATGACAATGTTTATGCAGGAGAGGCTCCAATTAGTATCTCTGGAAAAGAAGCCTTTATTTTTGTTGGGAT 
TACCCCTGACTTTAAAAAGATTATAAATAGCAATTTACATGGCGCTAAAAGTGATCTTATTGGTACTTTTAAAGAT 
CTTAATATTAAAAATTCAAAATTGGAAATTACAGTTGATGAGAATAATTCAGATGCCAAGACCTTCCTTGAATCTG 
TTAATTACATTATCGACGGCGTTGAAAAAATTTCACCTATGTTAACGAATTAA 

' .t5 61 .nt 

TGTACTTCAAAAGATAGCTCAAATGAATATGTTGAGGAGCAAGAAGCGGAGAACTCTTCTAAGCCTGATGATTCTA 
AAATAGATGAACATACTATTGGGCACGTTTTTCACGCTATGGGAGTAGTTCATTCAAAAAAGGATCGAAAAAGTTT 
GGGGAAAAATATAAAGGTTTTTTATTTTTCTGAAGAAGATGGACATTTTCAAACAATACCCTCAAAAGAGAATGCA 
AAGTTAATAGTTTATTTTTATGACAATGTTTATGCAGGAGAGGCTCCAATTAGTATCTCTGGAAAAGAAGCCTTTA 
TTTTTGTTGGGATTACCCCTGACTTTAAAAAGATTATAAATAGCAATTTACATGGCGCTAAAAGTGATCTTATTGG 
TACTTTTAAAGATCTTAATATTAAAAATTCAAAATTGGAAATTACAGTTGATGAGAATAATTCAGATGCCAAGACC 
TTCCTTGAATCTGTTAATTACATTATCGACGGCGTTGAAAAAATTTCACCTATGTTAACGAATTAA 

f604.aa . .... 

MSFNKTKKIGKKIKIVTLLMLAVSLIACNNNSEKEKLAFKVYIGGAPSSLDPHLVDETIGARILEQIFSGLLTLNT 
KTGKLKPGLAKNWEASKDKKTYQFYLRDNLFWSDGVEITAEGIRKSFLRILNKETGSTNVDMLKSIIKNGQEYFDG 
KVSDSELGIKAIDSKTLEITLTAPKPYFLEL 

FEKNERYYNAKEVELDELVYITSDNDLiVYN^ " 
KPLDDARVREALTLAIDRETLTYKVLNIXnVPTO 

LKYNTNETHKKIAAFIQNQWKKILNINLMLTNENWPVLTNSRNTGNFEIIRVGRIGEYLDPHTYFTIFTRENSQLA 
SYGYSNLEFDKLIRESDLEKDPIKRKQLLRi6^ESIIIEKDFPAAPIYIYSGHYLFRNDKWTGWNPNVSEVYYLSEL 
KPIKNAKHN " " ' 

t604 .aa 

CNNNSEKEKLAFKVYIGGAPSSLDPHLVDETIGARILEQIFSGLLTLNTKTGKLKPGLAKIWEASKDKKTYQFYLR 
DNLFWSDGVEITAEGIRKSFLRILNKETGSTNVDMLKSIIKNGQEYFDGKVSDSELGIKAIDSKTLEITLTAPKPY 
FLELLLHYAFMPVPIHVIEKYKGNWTSPENMV^ 

TVTNMYKNNEIDAIFNSIPPDIVNEIKLQKDYYQHKSNAIYLYSFNTKIKPL 
DGTVPTRE IT PDLKNYNYGKKI^LFD.PEKSK 

LMLTNENWPVLTNSRNTGNFE I IRVGRIGEYLDPHTYFTIFTRENSQLAS YGYSNLEFDKL I RESDLEKDP I KRKQ 
LLRKAESIIIEKDFPAAPIYIYSGHYLFRNDKWTGWNPNVSEVYYLSELKPIKNAKHN 

f604.nt ' s 

. ATGAGCTTTAATAAAACTAAAAAAATCGGTAAAAAAATTAAAATAGTAACACTACTTATGCTTGCTGTGTCTTTAA 
TTGCATGCAATAATAATTCA.GAAAAAGAAAAATTAGCATTTAAAGTATACATAGGGGGAGCGCCCTCATCGCTTGA 
CCCTCATTTGGTAGATGAGACAATAGGAGCAAGAATTTTAGAACAAATATTCTCAGGGCTTTTGACATTAAATACC 
AAAACAGGAAAGCTAAAGCCCGGACTTGCTAAAAATTGGGAAGCCTCAAAAGATAAAAAAACATATCAATTTTATC 
TAAGGGACAACCTTTTTTGGAGCGATGGAGTTGAAATTACCGCTGAAGGGAT^GAAAATCTTTTTTAAGAATTTT 
AAATAAAGAJ^CAGGATCTACAAATGTTGACATGCTCAAATCAATAATAAAAAATGGACAAGAGTATTTTGACGGG 
AAAGTATCCGATTCTGAACTTGGAATCAJVGGCAATTGATAGTAAAACGCTGGAAATAACACTTACGGCCCCAAAGC 
CATATTTTCTTGAACTGCTTCTACATTACGCATTCATGCCAGTACCTATTCATGTGATTGAAAAATATAAGGGAAA 
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TTGGACAAGCCCTGAAAACATGGTTACTAGCGGTCCTTTTAAATTAAAAAAAAGATTACCTAATGAAAAAATTATC 
TTTGAAAAAAACGAACGTTATTATAATGCAAAAGAAGTAGAACTTGATGAGCTTGTCTACATTACGTCTGACAATG- 
ATCTTA.CTGTGTACAATATGTACAAAAACAACGAAATTGATGCTATTTTTAACAGCATCCCGCCGGACATTGTAAA 
TGAAATAAAACTACAAAAAGACTATTACCAACACAAAAGTAATGCAATTTATTTATATTCATTTAATACAAAAATA 
AAACCCCTTGATGATGCTAGAGTTAGAGAAGCTTTAACCTTAGCTATTGACAGAGAAACTTTAACTTACAAAGTGC 
TAAATGATGGCACAGTTCCTACAAGAGAAATAACTCCTGATCTTAAAAATTACAATTACGGTAAAAAATTGGCTTT 
ATTTGATCCTGAAAAATCTAAAAAGCTTTTGGCAGATGCAGGGTATCCTAATGGGAAAGGATTCCCAATGCTAACA 
CTAAAATATAATACAAACGAAACTCATAAAAAAATTGCTGCATTTATO 

TCAATCTTATGCTTACCAACGAAAATTGGCCTGTTCTTACCAACAGCAGAAATACTGGCAATTTTGAAATAATAAG 
AGTTGGACGCATTGGGGAATATTTAGATCCACACACATACTTTACTATATTCACAAGAGAAAATTCACAACTTGCA 
TCATACGGATATTCAAACCTAGAATTTGACAAACTCATCAGAGAATCA.GATCTTGAAAAAGATCCTATAAAAAGAA 
AACAATTACTCAGAAAAGCAGAATCAATAATAATTGAAAAAGATTTTCCTGCTGCACCAATATACATATATTCTGG 
GCATTATCTTTTTAGAAACGATAAATGGACTGGATGGAATCCTAATGTATCAGAGGTTTATTATCTTTCTGAATTA 
AAACCAATTAAAAATGCAAAACATAATTAA 

t604.nt j 

TGCAATAATAATTCAGAAAAAGAAAAATTAGCATTTAAAGTATACATAGGGGGAGCGCCCTCATCGCTTGACCCTC 
ATTTGGTAGATGAGACAATAGGAGCAAGAATTTTAGAAGAAATATTCTCAGGGCTTTTGACATTAAATACCAAAAC 
AGGAAAGCTAAAGCCCGGACTTGCTAAAAATTGGGAAGCCTCAAAAGATAAAAAAACATATCAATTTTATCTAAGG 
GACAACCTTTTTTGGAGCGATGGAGTTGAAATTACCGCTGAAGGGATAAGAAAATCTTTTTTAAGAATTTTAAATA 
• AAGAAACAGGATCTACAAATGTTGACATGCTCAAATCAATAATAAAAAATGGACAAGAGTATTTTGACGGGAAAGT 
ATCCGATTCTGAACTTGGAATCAAGGCAATTGATAGTAAAACGCTGGAAATAACACTTACGGCGCCAAAGCCATAT 
TTTCTTGAACTGCTTCTACATTACGCATTCATGCCAGTACCTATTCATGTGATTGAAAAATATAAGGGAAATTGGA 
C AAGC C C TG AAAAC ATGGTT AC T AGC GGTCC TTTT AAATT AAAAAAAAG ATT AC C T AATGAAAAAATT ATC TTTG A 
AAAAAACGAACGTTATTATAATGCAAAAGAAGTAGAACTTGATGAGCTTGTCTACATTACGTCTGACAATGATCTT 
ACTGTGTACAATATGTACAAAAACAACGAAATTGATGCTATTTTTAACAGCATCCCGCCGGACATTGTAAATGAAA 
TAAAACTACAAAAAGACTATTACCAACACAAAAGTAATGCAATTTATTTATATTCATTTAATACAAAAATAAAACC 
CCTTGATGATGCTAGAGTTAGAGAAGCTTTAACCTTAGCTATTGACAGAGAAACTTTAACTTACAAAGTGCTAAAT 
GATGGCACAGTTCCTACAAGAGAAATAACTCCTGATCTTAAAAATTACAATTACGGTAAAAAATTGGCTTTATTTG 
ATCCTGAAAAATCTAAAAAGCTTTTGGCAGATGCAGGGTATCCTAATGGGAAAGGATTCCCAATGCTAACACTAAA 
ATATAATACAAACGAAACTCATAAAAAAATTGCTGCATTTATTCAAAACCAATGGAAAAAAATTCTAAATATCAAT 
CTTATGCTTACCAACGAAAATTGGCCTGTTCTTACCAACAGCAGAAATACTGGCAATTTTGAAATAATAAGAGTTG 
GACGCATTGGGGAATATTTAGATCCACACACATACTTTACTATATTCACAAGAGAAAATTCACAACTTGCATCATA 
CGGATATTCAAACCTAGAATTTGACAAACTC ATC AGAGAATCAGATCTTGAAAAAG ATC CT AT AAAAAG AAAAC AA 
TTACTCAGAAAAGCAGAATCAATAATAATTGAAAAAGATTTTCCTGCTGCACCAATATACATATATTCTGGGCATT 
ATCTTTTTAGAAACGATAAATGGACTGGATGGAATCCTAATGTATCAGAGGTTTATTATCTTTCTGAATTAAAACC 
AATTAAAAATGCAAAACATAATTAA 

f736.aa 

' mkkviilifmlstsllyncknqdnekivsiggsttvspildemilrynkinnntkvtydaqgssvginglfnkiyk. 
iaissrdltkeeieqgaketvfaydalifitspeikitniteenLak^ 

syssikdlllnkifktheeaqfrqdgiwksngevxektsltphsigyiglgyaknsiekglnilsvnstyptket 
insnkytikrnliivtnnkyedksvtqfidfmtsstgqdiveeqgflgikt ■ 

. t736 .aa - 
CKNQDNEKIVSIGGSTTVSPILDEMILRYNKINN^^ 

ETVFA YDAL I F ITS PE IKITNITE ENLAKI LNGE IQNWKQVGG PDAKINF INRDS S SGS YSS IKDLLLNKI r KTHE 
EAQFRQDGI WKSNGE VI EKTSLT PHS IGY IGLGYAKNS I EKGLNI LSVNSTYPTKETINSNKYTIKRNL I IVTNN 
KYEDKSVTQFIDFMTSSTGQDIVEEQGFLGIKT 



f736 .nt 
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ATGAAAAAAGTTATTATCTTAATTTTTATGCTATCAACAAGTTTATTATACAACTGTAAAAATCAAGACAATGAAA 

AAATTGTATCAATTGGAGGATCTACAACTGTAAGCCCAATACTAGACGAAATGATTTTAAGATATAATAAAATAAA 

CAATAATACTAJ^AGTAACATACGATGCACAAGGAAGTAGTGTTGGCATAAACGGGCTATTTAACAAAATATATAAA 

ATAGCAATATCATCAAGAGATTTAACAAAAGAAGAAATTGAACAAGGGGCAAAAGAAACTGTATTTGCTTATGATG 

CTTTAATTTTCATTACAAGCCCTGAAATAAAAATTAGAAATATTACAGAAGAAAATCTAGCTAAAATACTAAATGG 

AGAAATTCAAAATTGGAAACAAGTGGGAGGTCCTGATGCTAAAATCAACTTTATCAATCGAGACTCTTCTTCTGGT 

TCTTATTCGTCTATAAAAGACCTACTTCTTAATAAAATATTCAAAACTCACGAAGAAGCTCAATTTAGACAAGACG 

GAATAGTGGTAAAATCTAATGGAGAGGTAATTGAAAAAACAAGCCTTACTCCCCACTCAATAGGATATATAGGTCT 

TGGATACGCAAAAAATTCAATAGAAAAGGGTTTGAATATTCTTTCTGTTAACAGCACATATCCTACA^AGAAACA 

ATA^TAGCAATAAATACACCATTAAAAGAAATTTAATAATAGTTACAAATAACAAATACGAGGATAAAAGCGT^ 

CTCAATTTATTGATTTCATGACAAGCTCAACTGGACAAGATATTGTTGAAGAACAAGGCTTTTTAGGGATAAAAAC 
' ATAA 

t736.nt 

TGTAAAAATCAAGACAATGAAAAAATTGTATCAATTGGAGGATCTACAACTGTAAGCCCAATACTAGACGAAATGA 

TTTTAAGATATAATAAAATAAJ^CAATAJVTACTAAAGTAACATACGATGCACAAGGAAGTAGTGTTGG 

GCTATTTAACAAAATATATAAAATAGCAATATCATCAAGAGATTTAACAAAAGAAGAAATTGAACAAGGGGCAAAA 

GAAACTGTATTTGCTTATGATGCTTTAATTTTCATTACAAGCCCTGAAATAAAAATTACAAATATTACAGAAGAAA 

ATCTAGCTAAAATACTAAATGGAGAAATTCAAAATTGGAAACAAGTGGGAGGTCCTGATGCTAAAATCAACTTTAT 

CAATCGAGACTCTTCTTCTGGTTCTTATTCGTCTATAAAAGACCTACTTCTTAATAAAATATTCAAAACTCACGAA 

GAAGCTCAATTTAGACAAGACGGAATAGTGGTAAAATCTAATGGAGAGGTAATTGAAAAAACAAGGCTTACTCCCC 

A.CTCAATAGGATATATAGGTCTTGGATACGCAAAAAATTCAATAGAAAAGGGTTTGAATATTCTTTCTGTTAACAG 

CACATATCCTACAAAAGAAACAATAAATAGCAATAAATACACCATTAAAAG 

AAATACGAGGATAAAAGCGTAACTCAATTTATTGATTTCATGACAAGCTCAACTGGACAAGATATTGTTGAAGAAC 
■ AAGGC TTTTT AGGG AT AAAAAC AT AA 

• f752 .aa 

MNKKLNEVLLKLDQDLIKCVKGSLDLEISGVTYSSKLVLPRFVFFALPGIHFDGHDFIEIAIQKGSNVWCSRDVD 
FYSPNVTYIKVDDFNIRKFMSNFSNIFYDEPSKKLKVIGVTGTDGKSSVCYYIYLLFKKKGVKVGFISTVFFDDGS 
GSLIKNPYRQSTPESTEIHSFLSTMVKNEAQYAILESTSHGLDLETARLIDVNYFAWFTNIGHEHLEFHGTIQ^ 
LWKLGLFRSVSDDAGFGVINLDDLYSSDFKJNIAVKKSFTYSLKSSKADFFVSFIDEKTDSTRFEFYHKGVKYLANV 
SLLGSFNVENVMAALILVSQILNIDIQDIVIDKLNCIKSLDGRMDSINLGQNFSVIIDYAHTPGAFSKLFPI 
TNRLISVFGSAGERDVEKRFLQGQIADIYSDLIILCDEDPRGENSMCIIKDIAKGIVNKVENKDLFFIADRKQAIE 
' KAI S LAKAGDLWALGKGHES S 1 1 YKNREVFWNEQEWKNAI LSLEKS EKEK 

t752.aa - 

CVKGSLDLEISGVTYSSKLVLPRFVFFALPGIHFDGHDFIEIAIQKGS^rWVCSRDVT)FYSPNVTYIKVDDFNIRK 

FMSNFSNIFYDEPSKKLKVIGVTGTDGKSSVCYYIYLLFKKKGYKVGFISTVFFDDGSGSLIKNPYRQSTPESTEI 

HSFLSTMVKNEAQYAILESTSHGLDLETARLIDVNYFAWFTNIGHEHLEFHGTIQNYLNVKLGLFRSVSDDAGFG 

VINLDDLYSSDFKNAVKKSFTYSLKSSKADFFVSFIDEKTDSTRFEFYHKGV^ 

S Q I LN I D I Q D I VDKLNC I KS LDGRJXTO^ 

RFLQGQIADIYSDLIILCDEDPRGENSMCIIKDIAKGIVNKVENKDLFFIADRKQAIEKAISUAKAGDLW 
HESS I I YKNREVFWNEQEVVKNAI LSLEKS EKEK 

f752 .nt ; * • ~ " 

ATGAATAAAAAACTTAATGAAGTTTTATTAAAGTTAGATCAAGATTTAATAAAATGTGTAAAAGGTTCTCTTGATT 
TAGAAATATCAGGAGTTACTTATAGTTCTAAATTGGTTTTGCCCAGGTTTGTGTTTTTTGCTCTTCCAGGAATTCA 
TTTTGATGGGCATGATTTTATTGAAATTGCAATTCAAAAGGGTAGTAATGTTGTTGTGTGTTCACGAGATGTGGAT 
TTTTACAGTCCTAATGTTACTTA^TATTAAGGTAGATGACTTTAACATAAGAJVAATTTATGTCTAATTTTTCAAATA 
TTTTTTATGATGAGCCTTCAAAAAAATTAAAAGTTATTGGAGTCACTGGCACTGACGGGAAAAGTTCTGTTTGTTA 
TTATATATATCTTCTTTTTAAAAAAAAGGGTGTTAAAGTAGGTTTTATATCGACAGTATTTTTTGATGATGGGAGT 
GGAAGCTTGATTAAAAATCCTTACAGACAATCAACTCCCGAGTCTACGGAAATACATTCATTTTTAAGCACCATGG 
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TTAAAAATGAAGCTCAATATGCAATTCTTGAATCTACTTCTCATGGGCTTGACCTTGAAACAGCAAGGCTTATTGA 
TGTTAATTATTTTGCAGTTGTTTTTACCAATATTGGACATGAGCATCTTGAATTTCATGGCACAATTCAAAATTAT 
TTGAATGTCAAGCTGGGTCTTTTTCGGTCTGTTAGTGATGATGCTGGTTTTGGGGTTATTAATCTTGATGACCTTT 
ATTCTTCTGATTTTAAGAATGCTGTTAAGAAATCTTTTACTTATAGCTTAAAAAGCAGTAAAGCGGATTTTTTTGT 
TAGTTTTATTGATGAGAAAACCGATTCTACTAGATTTGAATTTTATCACAAGGGGGTTAAATATCTTGCTAATGTT 
AGCCTACTGGGGAGTTTTAATGTTGAGAATGTAATGGCTGCTCTTATTTTAGTTTCTCAAATTTTAAATATCGATA 
TTCAAGATATTGTTGATAAACTTAACTGCATTAAAAGTCTTGATGGGCGTATGGATAGTATTAATTTGGGGCAAAA 
TTTTTCTGTAATAATTGATTATGCTCATACTCCTGGTGCTTTTTCCAAGCTTTTTCCTATTTTTAAAAGATTTGCT 
ACCAATAGATTGATTTCTGTTTTTGGCTCTGCAGGAGAAAGAGATGTTGAAAAAAGATTTTTGCAAGGGCAAATCG 
CAGATATTTATTCTGATTTAATAATACTTTGCGATGAAGATCCAAGAGGCGAGAATAGTATGTGTATAATTAAAGA 
CATTGCAAAAGGAATTGTAAATAAAGTTGAAAATAAGGATTTATTTTTTATTGCTGATAGAAAGCAGGCTATTGAA 
AAAGCAATAAGTCTTGCAAAAGCAGGAGATTTGGTTGTTGCTTTGGGCAAAGGTCATGAAAGTTCAATAATTTATA 
AAAATAGAGAAGTTTTTTGGAATGAACAAGAGGTAGTTAAAAATGCTATTTTAAGTTTAGAAAAATCAGAAAAGGA 
GAAGTGA - 

t752.nt " . 

TGTGTAAAAGGTTCTCTTGATTTAGAAATATCAGGAGTTACTTATAGTTCTAAATTGGTTTTGCCCAGGTTTGTGT 
TTTTTGCTCTTCCAGGAATTCATTTTGATGGGCATGATTTTATTGAAATTGCAATTCAAAAGGGTAGTAATGTTGT 
TGTGTGTTCACGAGATGTGGATTTTTACAGTCCTAATGTTACTTATATTAAGGTAGATGACTTTAACATAAGAAAA 
TTTATGTCTAATTTTTCAAATATTTTTTATGATGAGCCTTCAAAAAAATTAAAAGTTATTGGAGTCACTGGCACTG 
ACGGGAAAAGTTCTGTTTGTTATTATATATATCTTCTTTTTAAAAAAAAGGGTGTTAAAGTAGGTTTTATATGGAC 
AGTATTTTTTGATGATGGGAGTGGAAGCTTGATTAAAAATCCTTACAGACAATCAACTCCCGAGTCTACGGAAATA 
CATTCATTTTTAAGCACCATGGTTAAAAATGAAGCTCAATATGCAATTCTTGAATCTACTTCTCATGGGCTTGACC 
TTGAAACAGCAAGGCTTATTGATGTTAATTATTTTGCAGTTGTTTTTACCAATATTGGACATGAGCATCTTGAATT 
TCATGGCACAATTCAAAATTATTTGAATGTCAAGCTGGGTCTTTTTCGGTCTGTTAGTGATGATGCTGGTTTTGGG 
GTTATTAATCTTGATGACCTTTATTCTTCTGATTTTAAGAATGCTGTTAAGAAATCTTTTACTTATAGCTTAAAAA' 
GCAGTAAAGCGGATTTTTTTGTTAGTTTTATTGATGAGAAAACCGATTCTACTAGATTTGAATTTTATCACAAGGG 
GGTTAAATATCTTGCTAATGTTAGCCTACTGGGGAGTTTTAATGTTGAGAATGTAATGGCTGCTCTTATTTTAGTT 
TCTCAAATTTTAAATATCGATATTCAAGATATTGTTGATAAACTTAACTGCATTAAAAGTCTTGATGGGCGTATGG 
ATAGTATTAATTTGGGGCAAAATTTTTCTGTAATAATTGATTATGCTCATACTCCTGGTGCTTTTTCCAAGCTTTT 
TCCTATTTTTAAAAGATTTGCTACCAATAGATTGATTTCTGTTTTTGGCTCTGCAGGAGAAAGAGATGTTGAAAAA 
AGATTTTTGCAAGGGCAAATCGCAGATATTTATTCTGATTTAATAATACTTTGCGATGAAGATCCAAGAGGCGAGA 
ATAGTATGTGTATAATTAAAGACATTGCAAAAGGAATTGTAAATAAAGTTGAAAATAAGGATTTATTT.TTTATTGC 
TGATAGAAAGCAGGCTATTGAAAAAGCAATAAGTCTTGCAAAAGCAGGAGATTTGGTTGTTGCTTTGGGCAAAGGT 
CATGAAAGTTCAATAATTTATAAAAATAGAGAAGTTTTTTGGAATGAACAAGAG'GTAGTTAAAAATGCTATTTTAA 
GTTTAGAAAAATCAGAAAAGGAGAAGTGA 

f798.aa 

MVFRTYKHLELIMLPMLMLSCAFFKKPQSVHQDSNTGKPISDEKLHLISGKISNKKLPIINSNHDVTWIKTKAMTI 
LGEDGKEI PEFKNKFGYS YI I S PVKMDGKYS YYASLLILFETTKNGDDEYEIEDVKFVTAGSTLELKNSLLAVENS 
QEEGYVTAYPFGILMSDEIK^AFKLTYKNGHWNYMLADLTVKNKLTQETKIYKISLNSKLIIEFLKEVLKENSILK 
DIAGDLFEDI 

t798.aa 

CAFFKKPQSVHQDSNTGK? I SDEKLHL I SGKISNKKLP I INSNHDVTWIKTKAMTI LGEDGKEI PEFKNKFGYS YI 
ISPVKMDGKYSYYASLLILFETTKNGDDEYEIEDVKFV^^ 

NAFKLTY^JGHWNYMIjADLTVKNKLTQETKIYKISLNSKIjIIEFLKEVLKENSILKDIAGDLFEDI 
f798.nt 

ATGGTATTTAGAACATATAAACATTTGGAACTAATAATGCTGCCCATGTTAATGCTGAGTTGCGCTTTTTTTAAGA 
AACCACAATCTGTACATCAA.GACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTACATTTAATATCAGGCAA 
AATTTCAAATAAAAAATTGCCAATCATAAATA^ 
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TTAGGCGAAGATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATAATATCTCCTGTAAAAA 

TGGATGGAAAATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATGGAGATGATGAATATGA 

AATTGAAGATGTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTTAGCTGTTGAAAATTCA 

CAAGAAGAAGGATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAAAATGCTTTTAAATTAA 

CATATAAAAATGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTACTCAAGAAACTAAAAT 

TTATAAAATTTCTCTTAATTCAAAATTAATTATTGAAT 

GACATAGCTGGAGATTTATTTGAAGATATATAA 

t798 .nt 

TGCGCTTTTTTTAAGAAACCACAATCTGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTAC 
ATTTAATATCAGGCAAAATTTCAAATAAAAAATTGCCAATCATAA 

AAAGGCAATGACAATCTTAGGCGAAGATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATA 

ATATCTCCTGTAAAAATGGATGGAAAATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATG 

GAGATGATGAATATGAAATTGAAGATGTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTT 

AGCTGTTGAAAATTCACAAGAAGAAGGATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAA 

AATGCTTTTAAATTAACATATAAAAATGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTA 

CTCAAGAAA.CTAJU^TTTATAAAATTTCTCTTAATTCAAAATTAATTATTGAATTTTTAAAAGAAG 

AAATTCTATATTAAAAGAC AT AGCTGGAGATTTATTTGAAGATATAT AA 

f805.aa 

MLRKLKDISKIVLVTDGLTPNCQTCGKLIANGDEVYIAEDGLFHSVKSNTIAGSTLTMIQGLKNLIEFGFSLSDAV 
QASSYNPTRILNIDKKGLICHGYDANLNVLDKDFNLKLTMIESKIIFNNL 

t805.aa 

CQTCGKLIANGDEVYIAEDGLFHSVKSNTIAGSTLTMIQGLKNLIEFGFSLSDAVQASSYNPTRILNIDKKGLICH 
GYDANLNVLDKDFNLKLTMI ESKI I FNNL 

f805.nt 

ATGCTTAGAAAGCTTAAAGATATAAGTAAAATAGTCCTTGTAACTGACGGACTTACTCCGAATTGTCAAACTTGTG 
GAAAACTAATTGCAAACGGAGACGAAGTTTATATTGCAGAAGATGGATTATTCCATAGCGTGAAAAGCAACACAAT 
AGCTGGATCAACACTCACAATGATACAAGGTCTTAAAAATTTAATAGAATTTGGTTTCAGCTTAAGCGATGCTGTT 
CAAGCAAGCTCTTACAATCGAACAAGAATTCTCAATATTGATAAAAAGGGCTTAATATGTCATGGATATGATGCAA 
ACCTCAATGTCCTAGATAAAGATTTTAATCTAAAGTTAACAATGATAGAATCTAAAATAATTTTTAACAATCTCTA 
A 

t805.nt 

TGTCAAACTTGTGGAAAACTAATTGCAAACGGAGACGAAGTTTATATTGCAGAAGATGGATTATTCCATAGCGTGA 
AAAGCAACACAATAGCTGGATCAACACTCACAATGATACAAGGTCTTAAAAATTTAATAGAATTTGGTTTCAGCTT 
AAGCGATGCTGTTCAAGCAAGCTCTTACAATCCAACAAGAATTCTCAATATTGATAAAAAGGGCTTAATATGTCAT 
GGAT ATGATGC AAACC TC AATGTC CT AGAT AAAG ATTTTAATC T AAAGTTAACAATGAT AG AATC T AAAATAATTT 
TT AACAATC TCT AA ' . 

f635.aa 

MKILWLIILVNLFLSCGNESKEKSNLGLRLRELEISGGGSESKIEVYKEFIEKEDKNILKIVNSIDKKARFFNLIG 
XEFFKLGQYGPAIEYFAKNLEINPNNYLSHFYIGVASYNLAI^ 
AISNMYVYDLDKQLEAKNYLNKLGDMGEDYFE 

LK * 
t635.aa 
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CGNESKEKSNLGLRLRELEISGGGSESKIEVYKEFIEKEDKNILKIVNSIDKKARFFNLIGLEFFKLGQYGPAIEY 
FAKNLEINPNNYLSHFYIGVASYNLAKN^ 

AKNYLNKLGDMGEDYFEFLMLRGANYYSLGDLGNAILFYDKASKKASTEEQKEGVSRIMSNLK 
f635.nt 

ATGAAAATTTTGTGGTTAATAATTCTTGTTAATTTATTTTTATCTTGTGGCAATGAATCTAAAGAAAAATCAAATC 
TTGGTCTTAGATTAAGAGAATTGGAAATTTCAGGTGGTGGATCTGAATCTAAGATTGAAGTT.TATAAAGAATTTAT 
TGAAAAAGAAGATAAGAATATTTTAAAGATAGTTAATTCCATTGATAAGAAAGCCAGATTTTTTAATTTAATTGGT 
CTTGAATTTTTTAAGCTTGGTCAGTACGGACCTGCTATTGAATATTTTGCTAAAAATTTAGAAATCAATCCCAATA 
ATTATTTATCTCATTTTTATATAGGTGTTGCTTCTTATAATTTAGCTAAAAATTTAAGAGTAAAAGATGAAGTTGA 
AAAATACATAATTCTTGCTGAAAATTCTTTTTTAAAATCACTTTCAATTAGAGATGATTTTAAAGATTCTCTTTTT 
GCCATTTCTAATATGTACGTATATGATCTTGATAAACAACTTGAAGCTAAAAATTATTTAAATAAACTTGGTGATA 

.TGGGTGAGGACTATTTTGAGTTTTTAATGTTAAGAGGTGCAAATTATTATTCGCTGGGCGATCTTGGTAATGGTAT 
ATTGTTTTATGATAAAGCTAGTAAAAAGGCTTCAACTGAAGAGCAAAAAGAAGGTGTTTCTAGGATCATGAGTAAT 

' TTGAAGTAA * 

t63 5 .nt 

TGTGGCAATGAATCTAAAGAAAAATCAAATCTTGGTCTTAGATTAAGAGAATTGGAAATTTCAGGTGGTGGATCTG • 
AATCTAAGATTGAAGTTTATAAAGAA r TTTATTGAAAAAGAAGATAAGA^ 

TAAGAAAGCCAGATTTTTTAATTTAATTGGTCTTGAATTTTTTAAGCTTGGTCAGTACGGACCTGCTATTGAATAT 
TTTGCTAAAAATTTAGAAATCAATCCCAATAATTATTTATCTCATTTTTATATAGGTGTTGCTTCTTATAATTTAG 
CTAAAAATTTAAGAGTAAAAGATGAAGTTGAAAAATACATAATTCTTGCTGAAAATTCTTTTTTAAAATCACTTTC 
AATTAGAGATGATTTTAAAGATTCTCTTTTTGCCATTTCTAATATGTACGTATATGATCTTGATAAACAACTTGAA- 
GCTAAAAATTATTTAAATAAACTTGGTGATATGGGTGAGGACTATTTTGAGTTTTTAATGTTAAGAGGTGCAAATT 
ATTATTCGCTGGGCGATCTTGGTAATGCTATATTGTTTTATGATAAAGCTAGTAAAAAGGCTTCAACTGAAGAGCA 
AAAAGAAGGTGTTTCTAGGATCATGAGTAATTTGAAGTAA . , . 

f314 .aa 

MNNCLIKFFIFLLVFSNSYVAFSKNVKV^ 

CGVGKWAGVWTSYILSKYNISHVINSGVAGGWSAKYKDIKV^ 

FNANKNLIKNAIEAIKSKVGGSNAYSGLIVSGDQFIDPTYINKIIGNFKDVIAVEMEGAAIGHVSHMFNIPFIVIR 
. SISDIVNKEGNEVEYSKFSKIAAFNSAKWQEILRKLZ . 

. t314 .aa - - . 

KNVNVLIVTAMDSEFDQINKLMSNKEEI 

VINSGVAGGWSAKYKDIKVGDVWSSEVAYHDVDLTKFGYKVGQLTGGLPQKFNANKNLIKNAIEAIKSKVGGSN 
AYSGLIVSGDQFIDPTYINKIIGNFKDVIAVEMEGAAIGHVSHMFNIPFIVIRSISDIVNKEGNEVEYSKFSKIAA 
FNS AKWQE I LRKL Z 

f.314.nt 

ATGAATAATTGTTTAATAAAGTTTTTTATTTTTTTATTAGTTTTTTCAAACAGTTATGTTGCT.TTTTGTAAAAATG 
TCAATGTTTTAATAGTAACTGCTATGGACTCTGAGTTTGATCAGATAAATAAGCTTATGTCTAATAAGGAAGAAAT 
AGTTCTTAAGGAGTATGGTCTTAATAAAAAGATTTTAAAGGGGAAGTTGTCTAATCGCAATGTTATGGTTATTATT 
TGTGGGGTTGGTAAGGTTAATGCTGGTGTGTGGACTAGCTACATTTTGTCAAAATACAACATAAGTCATGTCATTA 
ATTCTGGCGTTGCTGGTGGCGTTGTTAGTGCTAAATACAAAGATATTAAAGTGGGAGA.TGTGGTGGTGTCTTCAGA 
GGTTGCATATCATGATGTTGATTTGACTAAATTTGGATACAAGGTAGGACAGCTTACAGGAGGATTGCCTCAAAAA 
TTTAATGCCAATAAAAATTTAATTAAGAATGCCATAGAGGCCATTAAATCAAAGGTTGGAGGTTCTAATGCATATT 
C AGGATTAAT AGTTTC AGG AGATC AGTTT ATTG ATCC AAC TTAT ATT AJ^C AAAATT ATAGGAAAC TTT AAAGATGT 
AATAGCTGTTGAGATGGAJ\GGTGCAGCAJVTAGGGCATGTTTCTCATATGTTTAATATACCTTTTATAGTTATTAGG" 
TCAATATCTGACATTGTAAJVTAAAGAAGGGAATGAGGTTGAATATAGTAAATTTTCTAAAATAGCTGCTTTCAATT 
CAGCCAAAGTTGTACAAGAAATTTTAAGAAAACTTTAA 
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t314.nt 

AAAAATGTCAATGTTTTAATAGTAACTGCTATGGACTCTGAGTTTGATCAGATAAATAAGCTTATGTCTAATAAGG 
AAGAAATAGTTCTTAAGGAGTATGGTCTTAATAAAAAGATTTTAAAGGGGAAGTTGTCTAATCGCAATGTTATGGT 
TATTATTTGTGGGGTTGGTAAGGTTAATGCTGGTGTGTGGACTAGCTACA.TTTTGTCAAAATACAACATAAGTCAT 
GTCATTAATTCTGGCGTTGCTGGTGGCGTTGTTAGTGCTAAATACAAAGATATTAAAGTGGGAGATGTGGTGGTGT 
CTTCAGAGGTTGCATATCATGATGTTGATTTGACTAAATTTGGATACAAGGTAGGACAGCTTACAGGAGGATTGCC 
TCAAAAATTTAATGCCAATAAAAATTTAATTAAGAATGCCATAGAGGCCATTAAATCAAAGGTTGGAGGTTCTAAT 
GCATATTCAGGATTAATAGTTTCAGGAGATCAGTTTATTGATCCAACTTATATTAACAAAATTATAGGAAACTTTA 
AAGATGTAATAGCTGTTGAGATGGAAGGTGCAGCAATAGGGCATGTTTCTCATATGTTTAATATACCTTTTATAGT 
TATTAGGTCAATATCTGACATTGTAAATAAAGAAGGGAATGAGGTTGAATATAGTAAATTTTCTAAAATAGCTGCT 
TTCAATTCAGCCAAAGTTGTACAAGAAATTTTAAGAAAACTTTAA 

f32.aa 

MOTKTLYLISLILLACNKNNKIPLTQKLDLP 

KIEKTLEKTERYGIEGNWILWYKGTKRYIFSKDINIVNNLIIDHSKZ 
. t32 .aa ■ ' ' . t 

CNKNNKIPLIQKLDLPKSSILGFSNKMGIIIKDYAFLSKSTKKNSELDYDYAILLRKDEWKIEKTLEKTERYGIE 
GNWILVNYKGTKRYIFSKDINIVNNLIIDHSKZ * 

f32.nt 

ATGAATACAAAAACATTATATTTAATATCCTTT^ATTCTTTTAGCTTGCAATAAAAATAACAAAATTCCTCTCATTC 
AAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATAAAATGGGCATAATAATAAAAGATTATGCTTT 
TCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTACGCAATTCTACTCAGAAAAGACGAAGTCGTA 
AAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAAGGAAATTGGATCCTAGTCAATTACAAGGGAA 
CTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAAGAATTTAATAATTGATCATTCTAAATAG 

t32.nt 

TGCAATAAAAATAACAAAATTCCTCTCATTCAAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATA 
AAATGGGCATAATAATAAAAGATTATGCTTTTCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTA 
CGCiL^TTCTACTCAGAAAAGACGAAGTCGTAAAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAA 
GG AAATTGGATCCTAGTCAATTACAAGGGAACTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATT 
TAAT AATTGATC ATTC TAAATAG - " 

f320.aa 

MKSIYALLFLFINLSLLAJNNISKKDLEVLLKIAQAMN^ 

PENYKIPDLVNIDDFEDLKNLGAKTIKVRKILIEDLIRLIKDAKKFGIEIKIKSAYRTQEYQKFLFDYNVKTYGRK 
VAETQSAIPGHSQHHMGTAIDFINIDDNLLNTKEGKWLYENSLKYGFSVSYPKGYETDTGYKAEPWHYLYIGPKPC 
F I QKK YFNNLQHKL LEF WNQNKTNL I NL I EK YANZ 

t320.aa 

NNI SKKDLEVLLKI AQAMNKECKNF I EKNP IQFLKEIKPLVDAEKNNLLTLINKXI P I PENYKI PDLVNIDDFEDL 
KNLGAKTIKVRKI LI EDL I RL I KD AKKFG I E I KI KSAYRTQEYQKF LFD YNVKT YGRKVAETQS AI PGHSQHHMGT 
AIDFINIDDNLLNTKEGKWLYENSLKYGFSVSYPKGYETDTGYKAEPWHYLYIGPKPCFIQKKYFNNLQHKLLEFW 
NQNKTNLINLIEKYANZ * 



f320.nt 



158 

TABLE 1. Nucleotide and Amino Acid Sequences 



ATGAAATCAATTTATGCTTTATTATTTCTATTTATTAATTTATCTTTGTTGGCTAACAACATTTCAAAAAAAGATT 
TAGAAGTACTGCTAAAGATTGCCCAAGCAATGAATAAGGAATGCAAAAATTTTATTGAAAAAAATCCTATTCAGTT 
C TT AAAAGAAATAAAAC CC TTAGTAGATGC AG AAAAAAAT AAC CTC TTAAC TC TAA.TAAAT AAAAAAAT AC CAATT 
CCTGAAAATTATAAAATACCTGATCTGGTAAATATTGATGATTTTGAAGATCTTAAAAATCTTGGAGCAAAGACTA 
TTAAAGTAAGAAAAATATTAATCGAAGATTTAATTCGACTAATAAAAGATGCAAAAAAATTTGGGA.TTGAAATTAA 
AATCAAATCTGCTTACAGAACGCAAGAATATCAAJJLAATTTTTATTTGATTACAATGTCAAAACTTATGGCAGAA^ 
GTTGCAGAAACCCAATCAGCAATTCCAGGCCATTCTCAACATCATATGGGAACAGCAATAGATTTTATAAATATAG 
ATGATAATTTACTAAACACAAAAGAAGGAAAATGGCTTTATGAAAACTCTCTAAAATACGGATTTTCCGTTTCATA 
CCCAAAAGGATATGAAACGGACACTGGATATAAAGCAGAGCCTTGGCACTACTTATACATAGGACCTAAGCCATGC 
TTTATTCAGAAAAAATATTTTAATAATTTACAACATAAGCTTCTTGAATTTTGGAACCAGAACAAAACAAATCTTA 
TTAACCTAATTGAAAAATATGCAAACTAA 

t320.nt 

AACAACATTTCAAAAAAAGATTTAGAAGTACTGCTAAAGATTGCCCAAGCAATGAATAAGGAATGCAAAAATTTTA 
TTGAAAAAAATCCTATTCAGTTCTTAAAAGAAATAiyVACCCTTAGT AGATGC AG AAAAAAAT AACCTCT^ 
AATAAAT AAAAAAAT ACCAATTCCTGAAAATTATAAAATACCTGATCTGGTAAATATTGATGATTTTGAAGATCTT 
AAAAATCTTGGAGCAAAGACTATTAAAGTAAGAAAAATATTAATCGAAGATTTAATTCGACTAATAAAAGATGC^ 
AAAAATTTGGGATTGAAATTAAAATCAAATCTGCTTACAGAACGCAAGAATATCAAAAAT^ 

TGTCAAAACTTATGGCAGAAAAGTTGCAGAAACCCAATCAGCAATTCCAGGCCATTCTCAACATCATATGGGAACA 
GCAATAGATTTTATAAATATAGATGATAATTTACTAAACACAAAAGAAGGAAAATGGCTTTATGAAAACTCTCTAA 
AATACGGATTTTCCGTTTCATACCCAAAAGGATATGAAACGGACACTGGATATAAAGCAGAGCCTTGGCACTACTT 
ATACATAGGACCTAAGCCATGCTTTATTCAGAAAAAATATTTTAATAATTTACAACATAAGCTTCTTGAATTTTGG 
AACCAGAACAAAACAAATCTTATTAACCTAATTGAAAAATATGCAAACTAA 

£342.aa 

MLYLGDNKAMRTKIIIMTIIILLAPISGFSNSKESARGKFGAGIILPLPXALQINIGNFDLDIGLYSGVNNLFSDW 
KTLFIALDYIFYIYTFPGAANILDFSVGAGGYGTIWFSRFGGSKSGSGPMSIGARLPLALNIAVFRKKFDIFLRIA 
PGLGMNVWSNGVGFRWEVFAGLGLRFWFTZ 

t342.aa 

I^PISGFSNSKESARGKFGAGIILPLPIALQINIGNFDLDIGLYSGVNNLFSDWKTLFIALDYIFYIYTFPGAAN 
LDFSVGAGGYGTIWFSRFGGSKSGSGPMSIGARLPLALNIAVFRKKFDIFLRIAPGLGMIWWSNGVGFRWEVFAGL 
GLRFWFTZ ' " * ' - 

f342 .nt • 

ATGCTATACTTAGGAGATAATAAAGCAATGAGAACAAAAATAATTATTATGACAATTATTATTTTATTAGCCCCAA 
TCTCAGGATTTTCTAATTCAAAAGAATCTGCAAGGGGTAAATTTGGAGCAGGAATTATACTTCCATTACCAATTGC 
TCTACAGATTAATATAGGAAACTTTGATCTTGACATTGGTCTTTACAGCGGAGTAAATAATTTGTTTTCAGACTGG 
AAAACATTATTTATAGCATTAGACTATATTTTCTACATATACACATTCCCGGGAGCTGCTAATATTTTGGATTTTT 
CAGTTGGCGCAGGGGGATATGGAACAATATGGTTTTCAAGATTTGGAGGCAGTAAGTCAGGCTCAGGACCAATGAG 
CATTGGAGCAAGA.TTGCCTTTGGCCTTAAATATTGCAGTATTTAGGAAGAAATTCGACATATTTTTACGAATAGCA 
CCCGGACTTGGAATGAATGTTTGGAGTAATGGCGTTGGATTTAGATGGGAAGTATTCGCAGGATTGGGACTAAGAT 
TCTGGTTTACTTAA 

t342.nt 

TTAGCCCCAATCTCAGGATTTTCTAATTCAAAAGAATCTGCAAGGGGTAAATTTGGAGCAGGAATTATACTTCCA.T 
TACCAATTGCTCTACAGATTAATATAGGAAACTTTGATCTTGACATTGGTCTTTACAGCGGAGTAAATAATTTGTT 
TTCAGACTGGAAAACATTA.TTTATAGCATTAGACTATATyTTCTACATATACACATTCCCGGGAGCTGCTAATATT 
TTGGATTTTTCAGTTGGCGCAGGGGGATATGGAACAATATGGTTTTCAAGATTTGGAGGCAGTAAGTCAGGCTCAG 
GACCAATGAGCATTGGAGCAAGATTGCCTTTGGCCTTAAATATTGCAGTATTTAGGAAGAAATTCGACATATTTTT 
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ACGAATAGCACCCGGACTTGGAATGAATGTTTGGAGTAATGGCGTTGGATTTAGATGGGAAGTATTCGCAGGATTG 
GGACTAAGATTCTGGTTTACTTAA 

f352.aa 

MNKTKNRSLTYFIILSCISLFGANNNTISYSSIEIPLEDLSEEFKSSGNKSDQINTSKHLNKNIVSYEDPKKGKDL 
KLPENIRDKKLPQKRMDENDLKSVIENYENKIKN^ 

EIKKIiLNKKIKPKEDENYEKINIENIEEETDDDFEDNYEYNDEIEEQMRTITLLMKEZ 
t352 . aa * ■ 

CISLFGANNNTISYSSIEIPLEDLSEEFKSSGNKSDQINTSKHLNKNIVSYEDPKKGKDLKLPENIRDKKLPQKRM 

DENDLKSVIENYENKIKNIEKLLKTKNQKTSEN 

NYEKINIENIEEETDDDFEDNYEYNDEIEEQMRTITLLMKEZ 

f352.nt ' . . ^ 

ATGAATAAAACAAAAAATCGAAGCCTTACGTATTTTATAATACTTTCATGTATATCATTATTTGGGGCTAATAATA 

ATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTGAAGAATTTAAAAGTTCTGGGAATAAAAG 

CGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTATGAAGACCCAAAAAAGGGTAAAG 

AAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAAAAGAATGGACGAAAATGATCTAAAATCTGTAATTC 

AAAATT ATGAAAATAAAATTAAAAAC AT AGAAAAGCTTTTA 

TAAAAAAATAGAATCAATCGAAAAAAAAGCAAAAAAAT^ 

GAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAGAT^ 

TTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATGATGAAATTGAAGAACAAATGAGGACAAT 
TACCCTTCTAATGAAGGAATAA 

t352.nt 

TGTATATCATTATTTGGGGCTAATAATAATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTG 

AAGAATTTAAAAGTTCTGGGAATAAAAGCGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTA 

TGAAGACCCAAAAAAGGGTAAAGATCTAAAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAA 

G ACG AAAATG ATC T AAAATC TGTAATTG AAAATT ATGAAAAT AAAATT AAAAAC AT AG AAAAGC TTTT AAAAAC C A 

AAAATCAAAAAACATCGGAAAATGAAAATAAAAAAATAGAATC 

AACCAATAAATTAAAAAACGAAATAGTAGAAATAAAAAAGCTCCTTAACAAAAAAATCA^ 

AATTACGAAAAAATAAATATTGAAAACATTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATG 
ATGAAATTGAAGAACAAATGAGGACAATTACCCTTCTAATGAAGGAATAA 

f301.aa. 

MQIDGKIYSIISFPVRDSVSTLGVIGILICFDESLDIIENQLYSSLKFGSKNYNFFMLDRNYMPIF 

FSTAYSENFLSKVIAYAKKDSSSSQYTFNYERDFYSUSIFVKTDDFLTQGLILNVNSIPIMFKSNWIFVAFLLLSF 

AIIFYLCNTFVFSLINDFNRIVDYQKSKSDPFSLESPLEVKYSSSIISYISSKLDNLSSKSNESFEKIKFY3EDLN 

EYLEQIETAISNTESIDSSILVYEQLRDTFSRFEKSIVDILKGFESIADPINDHNKYISEliSSNFEESVSFFYSID 

KNLEIFNKVATINSTDIENIKSKVFDLNIVFENVNKNFADLLSQTNSLQSVNKLLVSISAQTNM 

GDAGKSFAWAEEIRKUUNSGKYSKTIKDELKTVDSII^^ 

HFKEIGEFKERYLSHDTKIRDAKNMYKEIFN^ 

ILKTKELIQKINDEIKDILFZ 

t301.aa 

CFDESLDIIENQLYSSLKFGSKNYNFFML^^ 
YERDFYSLNFVTCTDDFLTQGLILNVNSIPIMFKSNWVIFV^^ 

DPFSLESPLEVKYSSSIISYISSKLDNL3SKSNESFEKIKFYSEDLNEYLEQIETAISNTESIDSSILWSQLRDT 
FSRFEKSIVT)ILKGFESiADPINDHNKYISEISSNFSESVSFFY5IDKNLEIFNKVATINSTDIENIKSKVFDLNI 
VFENVNKNFADLLSQTNSLQSVNKLiL VS I S AQTNMLAMNAAI EAAKAGDAGKS FAWAEEI RKLAINSGKYS KT I K 
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TABLE 1 . Nucleotide and Amino Acid Sequences 

DELKTVDSIIAVINSEIDTIYKNFIDIQDNV^ 

FNNHYFISGKFNNFSQDLKEFKVSKMNLDAVSSLQEYSSLVKSSKDKILKTKELXQKINDEIKDILFZ • • 

f301.nt 

ATGCAAATAGATGGGAAAATTTATTCTATAATAAGTTTTCCAGTTAGAGATTCTGTTTCAACATTGGGTGTGATAG 
GGATTTTAATATGCTTTGATGAGTCGTTAGATATTATTGAAAATCAGTTGTATTCTTCTCTTAAATTTGGTAGTAA 
AAATTATAATTTTTTTATGCTTGACAGAAATTACATGCCCATTTTTTCAAACCTTAATAATC.TTCAGGCCAAATCT 
TTTTCTACAGCTTATAGTGAGAATTTTTTGAGTAAAGTTATAGCTTATGCTAAAAAAGATTCTTCTAGCTCTCAGT 
ACACTTTTAATTATGAAAGAGATTTTTATTCTTTAAACTTTGTAAAAACCGATGATTTTTTGACTCAGGGGCTTAT 
TTTAAATGTCAATTCCATTCCTATTATGTTTAAATCAAATTGGGTTATATTTGTTGCATTTTTATTATTGTCTTTT 
GCAATTATTTTTTATTTATGCAATACTTTTGTTTTTTCATTAATTAATGATTTTAACAGAATTGTTGACTATCAAA 
AATCAAAAAGCGATCCTTTTAGTCTTGAATCTCCCTTAGAGGTTAAGTATTCTTCATCTATTATTTCTTATATTAG 
TTCAAAGCTAGATAATCTGTCTTCTAAGAGTAATGAATCTTTTGAGAAGATAAAATTTTATTCTGAAGATTTGAAT 
GAATATTTGGAACAAATAGAAACTGCTATATCAAATACTGAGAGTATAGATTCTAGCATTTTAGTTTACGAACAAC 
TAAGAGATACTTTTTCTAGATTTGAAAAATCAATTGTTGATATTTTAAAAGGCTTTGAATCTATTGCTGATCCGAT 
TAATGATCACAATAAATATATATCAGAAATCTCTTCAAATTTTGAAGAGAGTGTTAGTTTTTTCTATAGTATAGAT 
AAAAATTTAGAAATTTTTAJ^TAAGGTTGCTACTATAAATTCTACTGATATTGAAAATATTAJ^AAGTAAGGTTTTTG 
ATTTAAATATTGTTTTTGAAAATGTGAATAAAAATTTTGCAGATCTTTTGTCTCAAACAAATAGTTTGCAAAGTGT 
AAATAAACTTTTAGTTTCAATTTCAGCTCAGACCAATATGCTTGCTATGAATGCAGCAATTGAAGCAGCAAAAGCA 
GGTGATGCAGGTAAAAGTTTTGCAGTTGTTGCTGAGGAGATTAGAAAGCTTGCTATTAATTCTGGAAAATATTCTA 
AAACCATTAAAGATGAACTTAAAACGGTCGACAGCATTATTGCAGTAATTAATTCAGAGATTGATACAATTTATAA 
AAATTTCATAGACATTCAAGATAATGTGGACAACAATTTTTCAAGACACGAGAAAGTAGATCTTACTCTTGCTAAG 
CATTTTAAAGAAATTGGGGAGTTTAAAGAAAGGTATTTGTCTCACGATACTAAGATCAGAGATGCTAAGAATATGT 
ATAAAGAAATATTTAATAATCATTATTTTATTAGTGGCAAGTTTAACAACTTTAGTCAAGATTTAAAAGAGTTTAA 
AGTTTCTAAGATGAATTTAGATGCGGTAAGTTCTCTTCAAGAATATTCATCTTTAGTAAAGTCTTCTAAGGATAAG 
ATATTAAAGACAAAGGAATTGATTCAAAAGATTAATGATGAGATTAAAGATATTCTTTTTTAG 

t301 .nt. 

TGCTTTGATGAGTCGTTAGATATTATTGAAAATCAGTTGTATTCTTCTCTTAAATTTGGTAGTAAAAATTATAATT^ 
TTTTTATGCTTGACAGAAATTACATGCCCATTTTTTCAAACCTTAATAATCTTCAGGCCAAATCTTTTTCTACAGC 
TTATAGTGAGAATTTTTTGAGTAAAGTTATAGCTTATGCTAAAAAAGATTCTTCTAGCTCTCAGTACACTTTTAAT 
TATGAAAGAGATTTTTATTCTTTAAACTTTGTAAAAACCGATGATTTTTTGACTCAGGGGCTTATTTTAAATGTGA 
ATTCCATTCCTATTATGTTTAAATCAAATTGGGTTATATTTGTTGCATTTTTATTATTGTCTTTTGCAATTATTTT 
TTATTTATGCAATACTTTTGTTTTTTCATTAATTAATGATTTTAACAGAATTGTTGACTATCAAAAATCAAAAAGC 
GATCCTTTTAGTCTTGAATCTCCCTTAGAGGTTAAGTATTCTTCATCTATTATTTCTTATATTAGTTCAAAGCTAG 
ATAATCTGTCTTCTAAGAGTAATGAATCTTTTGAGAAGATAAAATTTTATTCTGAAGATTTGAATGAATATTTGGA 
ACAAATAGAAACTGCTATATCAAATACTGAGAGTATAGATTCTAGCATTTTAGTTTACGAACAACTAAGAGATACT 
TTTTCTAGATTTGAAAAATCAATTGTTGATATTTTAAAAGGCTTTGAATCTATTGCTGATCCGATTAATGATCACA 
ATAAATATATATCAGAAATCTCTTCAAATTTTGAAGAGAGTGTTAGTTTTTTCTATAGTATAGATAAAAATTTAGA 
•AATTTTTAATAAGGTTGCTACTATAAATTCTACTGATATTGAJ\AATATTAAAAGTAAGGTTTTTGATTTAAATATT 
GTTTTTGAAAATGTGAATAAAAATTTTGCAGATCTTTTGTCTCAAACAAATAGTTTGCAAAGTGTAAATAAACTTT 
TAGTTTCAATTTCAGCTCAGACCAATATGCTTGCTATGAATGCAGCAATTGAAGCAGCAAAAGCAGGTGATGCAGG 
TAAAAGTTTTGCAGTTGTTGCTGAGGAGATTAGAAAGCTTGCTATTAATTCTGGAAAATATTCTAAAACCATTAAA 
GATGAACTTAAAACGGTCGACAGCATTATTGCAGTAATTAATTCAGAGATTGATACAATTTATAAAAATTTCATAG 
ACATTCAAGATAATGTGGACAACAATTTTTCAAGACACGAGAAAGTAGATCTTACTCTTGCTAAGCATTTTAAAGA 
AATTGGCGA.GTTTAAJ^GAAAGGTATTTGTCTCACGATACTAAGATCAGAGATGCTAAGAATATGTATAAAGAAATA 
TTTAATAATCATTATTTTATTAGTGGCAAGTTTAACAACTTTAGTCAAGATTTAAAAGAGTTTAAAGTTTCTAAGA 
TGAATTTAGATGCGGTAAGTTCTCTTCAAGAATATTCATCTTTAGTAAAGTCTTCTAAGGATAAGATATTAAAGAC 
AAAGGAATTGATTCAAAAGATTAATGATGAGATTAAAGATATTCTTTTTTAG ' - " 



f346 .aa 
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TABLE 1. Nucleotide and Amino Acid Sequences 

MSIDKVPDEAFAEKIVGDGIAILPTSNELLAPCDGKIGKIFKTNHAFSLETKEGVEIFVHFGINTLNLNGKGFTRV 
AEEGINVKQGEVIIRLDLEYLKEHSESVITPWIANSDEVSSIEYSFGRLENDSEYILSSSTVLTEEIRHKISQTK 
PVIAGKDLVLRVKKZ 

t3 46.aa - .• 

■ CDGKIGKIFKTNHAFSLETKEGVEIFVHFGINTLNLNGKGFTRVAEEGINVKQGEVIIRLDLEYLKEHSESVITPV 
VIANSDEVSSIEYSFGRLENDSEYILSSSTVLTEEIRHKISQTKPVIAGKDLVLRVKKZ 

f346.nt ' 

■ ATGTCAATTGATAAGGTTCCCGATGAAGCTTTTGCTGAAAAAATAGTTGGCGATGGAATTGCAATTCTTCCAACAA 
GCAATGAGTTGTTGGCGCCTTGTGATGGGAAAATAGGTAAAATTTTTAAAACCAATCATGCCTTTAGCCTTGAAAC 
TAAAGAGGGCGTTGAAATTTTTGTCCATTTTGGAATTAATACTCTTAATTTAAATGGTAAGGGTTTTACAAGAGTT 
GCTGAAGAGGGCATTAATGTTAAACAAGGTGAAGTTATTATTAGGCTTGATCTTGAATATTTAAAAGAGCATTCAG 
AATCCGTTATTACTCCGGTTGTTATTGCAAATTCTGATGAAGTTTCAAGTATAGAATATTCTTTTGGAAGGCTTGA 
AAATGATTCTGAATATATTTTATCATCTTCAAGTGTCTTGACAGAAGAAATTAGGCATAAAATATCTCAAACAAAG 
CCTGTTATAGCGGGCAAAGATTTGGTGTTGCGAGTTAAAAAGTAA 

t346 .nt - 

TGTGATGGGAAAATAGGTAAAATTTTTAAAACCAATCATGCCTTTAGCCTTGAAACTAAAGAGGGCGTTGAAATTT 
TTGTCCATTTTGGAATTAATACTCTTAATTTAAATGGTAAGGGTTTTACAAGAGTTGCTGAAGAGGGCATTAATGT 
. TAAACAAGGTGAAGTTATTATTAGGCTTGATCTTGAATATTTAAAAGAGCATTCAGAATCCGTTATTACTCCGGTT 
GTTATTGCAAATTCTGATGAAGTTTCAAGTATAGAATATTCTTTTGGAAGGCTTGAAAATGATTCTGAATATATTT 
TATCATCTTCAACTGTCTTGACAGAAGAAATTAGGCATAAAATATCTCAAACAAAGCCTGTTATAGCGGGCAAAGA 
TTTGGTGTTGCGAGTTAAAAAGTAA 

-f 373 .aa 

MNYQRIKNYCKFTSVFLFFLFSCVSNELKLDQSLVKGKLWGLRYYIYKNQTPKNAVNM 

IAHYLEHMAFNGTKDYPGNSIVDVLKKFGMQFGADINAATSFDFTYYRLDLSDGNNKDEIDESINILRNWASQISF 

MKEEIDLERNIIIEEKKLGETYPGRIYEKMDKFLTSGSLYEFRSPIGLEEQILSFQPEDFKKFYRKWYRPELASVI 

WGDIDPIEIEEKIKKQFVSWKNPTDKIKEVKVSLDVELKDKFLLLEDLEVGEP'SLMFFKKEIINFVKTKDDLLNA 

IKKSLLAALFENRFSELKTAGVKQFKNVSNKDFFSFKSDNNTIVAKSISLNFNPDHLNEGIQDFFYELERIRKFG 

TQGELEKVRSQFYKSLELRKKNINKTNSWAIFQDLIEIAINGSNKFDMNEYCDLSFQYLEKIDLKTINNLVGREFD 

VKNCAIFYSYHGRAHPVLTLEDIDNLQKIALKRELKPYENSLIEGKFFKKSLDDKDIIRENEFENEISSFVLENGV 

EVYFKYNDQKKGVIDFSATSWGGLINEDLKLIPVLSFAPGWSGSGYGDYSALQIEKYLSDKAVSLRVGVGAQESY 

ISGSSDKKDLETLFQLIYFTFKEPKIDDVSLQNAINNIKALIKSNENSSDYHFHKAISKFLNNNDPRFEDTKDSDL 

QYFTKENILSFYKKRFTYANNFKFVLLETQIFRQZ 

t373 .aa - * ' . 

CVSNELKLDQJSLVKGKLVNGLRYYIYKNQT^ 

.DVLKXFGMQFGADINAATSFDFTYYRLDLSDGNNKDEIDESINILRNWASQISFMKEE 

PGRIYEKMDKFLTSGSLYEFRSPIGLEEQILSFQPEDFKKFYRKV^PELASVIWGDIDPIEIEEKIKKQFVSWK 
NPTDKIKEVKVSLDVELKDKFLLLEDLEVGEPSLMFFKKEIINFVKTKDDLLNAIKKSLLAALFENRFSELKTAGV 
KQFKNVSNKDFFSFKSDNNTIVAKSISLNFNPDHLNEGIQDFFYELERIRKFGFTQGELEKVRSQFYKSLELRKKN 
INKTNSWAIFQDLIEIAINGSNKFDMNEYCDLSFQYLEKIDLKTINNLVGREFDVKNCAIFYSYKGRAHPVLTLED 
IDNLQKIALKRELKPYENSLIEGKFFKKSLDDKDIIRENEFENEISSFVLENGVEVYFKYNDQKKGVIDFSATSWG 
GLINEDLKLIPVLSFAPGWSGSGYGDYSALQIEKYLSDKAVSLRVGVGAQESYISGSSDKKDLETLFQLIYFTFK 
EPKIDDVSLQNAINNIKALIKSNENSSDYHFHKAISKFLNNNDPRFEDTKDSDLQYFTKENILSFYKKRFTYANNF- 
KFVLLETQIFRQZ 



f373 .nt 
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TABLE 1. Nucleotide and Amino Acid Sequences 

ATGAATTATCAAAGAATTAAGAATTATTGTAAATTTACAAGCGTTTTTCTATTTTTTTTGTTTTCCTGTGTTTCTA 

ATGAGTTAAAGTTAGATCAAAGTTTGGTAAAAGGAAAACTTGTCAATGGGCTAAGGTATTATATTTATAAAAATCA 

AACCCCAAAGAATGCCGTTAATATGGGAATTGTTTTTAATGTGGGCTCACTTAATGAAGAAGATAATGAGAGGGGA 

ATAGCGCATTATCTTGAACATATGGCTTTTAATGGTACAAAAGATTATCCAGGGAATTCTATAGTTGATGTTCTTA 

AAAAATTTGGAATGCAATTTGGTGCTGACATTAATGCTGCTACTAGTTTTGATTTCACTTATTATAGACTTGATTT 

GTCAGATGGTAATAATAAAGATGAAATTGATGAATCTATAAATATTTTGAGAAACTGGGCTTCTCAAATCAGTTTC 

ATGAAAGAAGAAATAGATCTAGAGCGAAATATTATTATTGA.GGAAAAAAAGCTTGGTGAGACTTATCCTGGAAGAA 

TTTATGAGAAAATGGATAAGTTTTTGACAAGCGGAAGTCTTTATGAATTTAGAAGTCCTATTGGACTTGAAGAGCA 

AATTTTATCTTTTCAGCCAGAAGATTTTAAAAAATTTTATAGAAAGTGGTATAGGCCAGAACTTGCAAGTGTTATT 

GTGGTAGGAGATATTGATCCTATAGAAATTGAAGAGAAGATAAAGAAGCAATTTGTTTCTTGGAAAAATCCAACCG 

ATAAAATTAAAGAAGTAAAAGTAAGTTTAGACGTAGAGCTTAAGGATAAATTTTTACTTTTAGAAGATTTGGAAGT 

TGGAGAGCCTAGTTTAATGTTCTTTAAAAAGGAAATTATTAACTTTGTAAAGACCAAAGATGACCTTTTAAATGCT 

ATTAAAAAGTCTTTATTAGCCGCTCTTTTTGAAAATAGATTTTCTGAATTAAAGACTGCTGGGGTAAAGCAATTTA 

AAAATGTTTCAAATAAAGATTTTTTCTCATTTAAATCAGATAACAATACCATTGTTGCAAAATCGATTTCTTTAAA 

CTTTAATCGAGATCATTTGAACGAAGGAATACAAGACTTTTTTTATGAGCTTGAGAGGATAAGiy^AATTTGGATTT 

ACCCAAGGTGAGCTTGAAAAAGTTAGATCTCAATTTTACAAATCTTTAGAATTAAGGAAAAAGAATATAAATAAAA 

CAAATTCATGGGCTATTTTTCAGGATTTAATAGAAATTGCTATTAATGGTTCTAATAAATTTGATATGAATGAATA 

TTGCGATCTTTCTTTTCAATATTTGGAAAAGATTGATTTAAAAACAATAAACAATCTTGTAGGAAGAGAGTTTGAT 

GTAAAAAATTGTGCAATTTTTTATTCTTACCATGGAAGAGCACATCCTGTTTTAACTCTTGAAGATATTGACAATC 

TTCAAAAGATAGCTTTAAAAAGAGAGTTAAAGCCTTATGAGAATTCTTTAATTGAAGGTAAATTTTTTAAGAAGTC 

TTTAGATGATAAAGATATTATTAGAGAAAATGAGTTTGAAAATGAAATTTCGTCATTTGTTCTTGAAAATGGGGTT 

GAAGTTTATTTTAAATATAATGATCAAAAAAAAGGTGTAATTGATTTTAGTGCAACTTCTTGGGGAGGTTTAATTA 

ATGAAGATTTAAAACTTATTCCTGTTTTATCTTTTGCTCCCGGAGTAGTATCTGGTTCGGGTTATGGTGATTATTC 

TGCATTACAGATTGAAAAATATTTATCAGATAAAGCTGTTTCTTTAAGAGTTGGGGTTGGAGCTCAAGAATCATAT 

ATTTCTGGAAGTTCAGATAAAAAAGATCTTGAAACTCTTTTTCAGCTTATATATTTTACTTTTAAGGAACCCAAAA 

TTGATGATGTTTCTTTGCAAAATGCTATTAATAATATAAAAGCATTAATAAAGAGCAATGAAAATAGTTCTGATTA 

TCATTTTCATAAAGCCATTAGTAAATTTTTAAACAATAATGATCCTAGATTTGAAGATACAAAAGATAGTGATTTG 

CAATATTTTACAAAAGAAAATATTTTGTCTTTTTAT 

TGCTGGAGACTCAGATATTCAGACAATAA 

t373 !nt 

TGTGTTTCTAATGAGTTAAAGTTAGATCAAAGTTTGGTAAAAGGAAAACTTGTCAATGGGCTAAGGTATTATATTT 
ATAAAAATCAAACCCCAAAGAATGCCGTT^TATGGGAATTGTTTTTAATGTGGGCTCACTTAATGAAGAAGATAA 
TGAGAGGGGAATAGCGCATTATCTTGAACATATGGCTTTTAATGGTACAAAAGATTATCCAGGGAATTCTATAGTT 
GATGTTCTTAAAAAATTTGGAATGCAATTTGGTGCTGACATTAATGCTGCTACTAGTTTTGATTTCACTTATTATA 
GACTTGATTTGTCAGATGGTAATAATAAAGATGAAATTGATGAATCTATAAATATTTTGAGAAACTGGGCTTCTCA 
AATCAGTTTCATGAAAGAAGAAATAGATCTAGAGCGAAATATTATTATTGAGGAAAAAAAGCTTGGTGAGACTTAT 
CC TGGAAGAATTT ATGAGAAAATGGATAAGTTTTTGAC AAGC GGAAGTC TTT ATG AATTTAGAAGTC C TATTGGAC 
TTGAAGAGCAAATTTTATCTTTTCAGCCAGAAGATTTTAAAAAATTTTATAGAAAGTGGTATAGGCCAGAACTTGC 
AAGTGTTATTGTGGTAGGAGATATTGATCCTATAGAAATTGAAGAGAAGATAAAGAAGCAATTTGTTTCTTGGAAA 
AATCCAACCGATAAAATTAAAGAAGTAAAAGTAAGTTTAGACGTAGAGCTTAAGGATAAATTTTTACTTTTAGAAG 
ATTTGGAAGTTGGAGAGCCTAGTTTAATGTTCTTTAAAAAGGAAATTATTAACTTTGTAAAGACCAAAGATGACCT 
TTTAAATGCTATTAAAAAGTCTTTATTAGCCGCTCTTTTTGAAAATAGATTTTCTGAATTAAAGACTGCTGGGGTA 
AAGCAATTTAAAAATGTTTCAAATAAAGATTTTTTCTCATTTAAATCAGATAACAATACCATTGTTGCAAAATCGA 
TTTCTTTAAACTTTAATCCAGATCATTTGAACGAAGGAATACAAGACTTTTTTTATGAGCTTGAGAGGATAAGAAA 
ATTTGGATTTACCCAAGGTGAGCTTGAAAAAG.TTAGATCTCAATTTTACAAATCTTTAGAATTAAGGAAAAAGAAT 
ATAAATAAAACAAATTCATGGGCTATTTTTCAGGATTTAATAGAAATTGCTATTAATGGTTCTAATAAATTTGATA 
TGAATGAATATTGCGATCTTTCTTTTCAATATTTGGAAAAGATTGATTTAAAAACAATAAACAATCTTGTAGGAAG 
AGAGTTTGATGTAAAAAATTGTGCAATTTTTTATTCTTACCATGGAAGAGCACATCCTGTTTTAACTCTTGAAGAT 
ATTGACAATCTTCAAAAGATAGCTTTAJ^AAAGAGAGTTAAAGCCTTATGAGAATTCTTTAATTGAAGGTAAATTTT 
TTAAGAAGTCTTTAGATGATAAAGATATTATTAGAGAAAATGAGTTTGAAAATGAAATTTCGTCATTTGTTCTTGA 
AAATGGGGTTGAAGTTTATTTTAAATATAATGATCAAAAAAAAGGTGTAATTGATTTTAGTGGAACTTCTTGGGGA 
GGTTTAATTAATGAAGATTTAAAACTTATTCCTGTTTTATCTTTTGCTCCCGGAGTAGTATCTGGTTCGGGTTATG 
GTGATTATTCTGCATTACA-GATTGAAAAATATTTATCAGATAAAGCTGTTTCTTTAAGA.GTTGGGGTTGGAGCTCA 
AGAATCATATATTTCTGGAAGTTCAGATi^AAAAAGATCTTGAAACTCTTTTTCAGCTTATATATTTTACTTTTAA.G 
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GAACCCAAAATTGATGATGTTTCTTTGCAAAATGCTATTAATAATATAAAAGCATTAATAAAGAGCAATGAAAATA 
GTTCTGATTATCATTTTCATAAAGCCATTAGTAAATTTTTAAACAATAATGATCCTAGATTTGAAGATACAAAAGA 
TAGTGATTTGCAATATTTTACAAAAGAAAATATTTTGTCTTTTTATAAGAAAAGGTTTACTTATGCAAATAATTTT 
AAGTTTGTCTTGCTGGAGACTCAGATATTCAGACAATAA 

f 384 . aa 

MDWDFEKIIFLLNESTRLALSGCAKLILDFKSDGSIVTQVDKQIEQFLFKEIKKPGNFVLGEETISTYKEEYIKDA 
LISESTFIIDPIDGTSSFAAGLPSYGISLAYASGGKIIEGAISLPLSGEFFITSKDNVFYAKKNIGSYPLKKDFNK 
FIFDNSKCYNIHSLL^VSRSIIRLFNLDISSHIHINGSCVYSFAKLFTGSYKAYFSFVGLWDIAACLAIGNKLGMV 
GEFYCGNKMTLDILDSMYILEPNNHKRWSLKDFFIYSDNKSTIDIIRKDANKKINK 

.t3 8 4 .aa 

XAKLILDFKSDGSIVTQVDKQIEQFLFKEIKKPGNFVLGEETISTYKEEYIKDALISESTFIIDPIDGTSSFAAGL 
PSYGISLAYASGGKIIEGAISLPLSGEFFITSKDNVFYAKKNIGSYPLKKDFMKFIFDNSKCYNIHSLLAVSRSII 
RLFNLDISSHIHINGSCVYSFAKLFTGSYKAYFSFVGLWDIA 
NNKKRWSLKDFFIYSDNKSTIDIIRKDANKKINKZ 

f384.nt . • , 

ATGGATTGGGATT5TTGAAAAAATTATATTTTTATTAAATGAATCAACTAGGCTTGCATTAAGTGGTTGTGCTAAAT 
TAATTTTAGATTTTAAATCTGATGGGTCTATTGTAACTCAGGTTGATAAGCAAATTGAGCAATTCTTATTCAAAGA 
GATCAAAAAGCCTGGAAATTTTGTTCTTGGAGAAGAGACAATATCTACTTATAAAGAAGAGTATATCAAAGATGCT 
TTAATATCAGAGAGTACTTTTATTATTGATCCTATTGATGGAACTTCTTCTTTTGCAGCAGGCCTTCCTTCATATG 
GAATATCGCTAGCGTATGCTAGTGGCGGCAAAATTATTGAAGGAGCCATTTCTCTTCCTTTAAGGGGAGAGTTTTT 
. TATTACTTCTAAAGATAATGTATTTTATGCTAAAAAAAACATTGGTAGCTATCCTTTAAAAAAGGATTTTAATAAA. 
TTTATTTTTGATAATTCTAAATGTTACAATATTCATAGTTTACTTGCAGTTTCAAGGTCTATTATAAGGTTATTTA 
ATCTTGATATTTCTTCTCATATTCATATTAATGGTTCTTGTGTATATTCTTTTGCf AAACTTTTTACAGGTTCTTA 
TAAGGCCTACTTTTCTTTTGTAGGACTTTGG.GATATTGCAGCGTGTTTAGCTATTGGTAATAAATTGGGCATGGTT 
GGCGAATTTTATTGTGGTAATAAAATGACATTAGATATCTTAGATTCAATGTATATTTTAGAGCCTAATAATCATA 
AAAGATGGTCCTTGAAAGATTTTTTTATTTATTCTGATAATAAATCAACAATAGACATTATAAGAAAAGATGCAAA 
TAAAAAAATCAATAAGTAA 

t384 .nt 

AGTGGTTGTGCTAAATTAATTTTAGATTTTAAATCTGATGGGTCTATTGTAACTCAGGTTGATAAGCAAATTGAGC 
AATTCTTATTCAAAGAGATCAAAAAGCCTGGAAATTTTGTTCTTGGAGAAGAGACAATATCTACTTATAAAGAAGA 
GTATATCAAAGATGCTTTAATATCAGAGAGTACTTTTATTATTGATCCTATTGATGGAACTTCTTCTTTTGCAGCA 
GGCCTTCCTTCATATGGAATATCGCTAGCGTATGCTAGTGGCGGCAAAATTATTGAAGGAGCCATTTCTCTTCCTT 
TAAGCGGAGAGTTTTTTATTACTTCTAAAGATAATGTATTTf ATGCTAAAAAAAACATTGGTAGCTATCCTTTAAA 
AAAGGATTTTAATAAATTTATTTTTGATAATTCTAAATGTTACAATATTCATAGTTTACTTGCAGTTTCAAGGTCT 
ATTATAAGGTTATTTAATCTTGATATTTCTTCTCATATTCATATTAATGGTTCTTGTGTATATTCTTTTGCTAAAC 
TTTTTACAGGTTCTTATAAGGCCTACTTTTCTTTTGTAGGACTTTGGGATATTGCAGCGTGTTTAGCTATTGGTAA 
TAAATTGGGCATGGTTGGCGAATTTTATTGTGGTAATAAAATGACATTAGATATCTTAGATTCAATGTATATTTTA 
GAGCCTAATAATCATAAAAGATGGTCCTTGAAAGATTTTTTTATTTATTCTGATAATAAATCAACAATAGACATTA 
T AAG AAAAG ATGC AAAT AAAAAAAT C AAT AAGT AA 

f 860 .aa • 

^^AFYKLNDNIALAEDLLKYLLSSIIi^^ECSQDMDFLENYIEKGLIKKLENVINSNFEVITYTKAIEILENSKKNFEI 
KPYWGIDLQTDHERYLTEETFKKPVWIDYPKNFKAFYMKANKDI^TVKGMDILVPKIGE 
RIKELNLNIEHLNWYLDLRRFGSAPHSGFGLGLERLVQYSTGISNIRDSIPFPRTPKNLYFZ • 



t860 .aa 
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CSQDMDFLE^^^IEKGLIKKLENVINSNFEVITYTKAIEILENSKKNFEIKPYWGIDLQTDHERYLTEETFKKPVVV 
IDYPKNFKAFYMKANKDNKTVKGMDILVPKIGEIIGGSEREDDLQKLENRIKELNLNIEHLNWYLDLRRFGSAPHS 

GFGLGLERLVQYSTGI SNIRDS I PF PRTPKNLYFZ 
- f 860 .nt 

ATGGCTTTTTATAAGCTTAACGACAATATTGCCCTAGCAGAAGATCTCTTGAAATATCTTTTAAGTTCAATTTTAA 
ACGAATGCTCACAAGATATGGATTTTTTAGAAAATTACATTGAAAAAGGTTTAATTAAAAAACTAGAAAATGTAAT 
AAATTCAAATTTTGAGGTTATTACCTATACTAAAGCAATTGAAATTCTTGAAAACTCAAAAAAAAATTTTGAA^ 
AAACCTTACTGGGGAATAGATTTGCAAACAGATCACGAAAGATACCTAACAGAAGAGACTTTTAAAAAACCGGTAG . 
TGGTCATTGATTATCCAAAAAATTTCAAAGCATTTTACATGAA 

GGACATACTTGTTCCAAAAATTGGAGAGATTATAGGGGGAAGCGAAAGAGAAGATGACCTTCAAAAATTAGAAAAT 
AGAATAAAAGAATTAAACTTAAACATTGAACATCTAAACTGGTATCTTGATCTAAGAAGATTTGGCTCGGCTCCTC 
ATTCTGGCTTTGGACTTGGACTTGAAAGATTGGTGCAATACTCAACAGGAATATCTAATATAAGAGATTCAATACC . 

ATTCCCAAGGACTCCTAAAAATCTTTATTTTTAA 
t860 .nt ■ 

TGCTCACAAGATATGGATTTTTTAGAAAATTACATTGAAAAAGGTTTAATTAAAAAACTAGAAAATGTAATAAAT^ 
CAAATTTTGAGGTTATTACCTATACTAAAGCAATTGAAATTCTTGAAAACTGAAAAAAAAA 

TTACTGGGGAATAGATTTGCAAACAGATCACGAAAGATACCTAACAGAAGAGACTTTTAAAAAACCGGTAGTGGTC 

attgattatccaAaaaatttcaaagcattttacatgaaagcaaataaagacaataaaactgttaaaggaatggaca' 

TAC TTGTTC C AAAAATTGGAGAG ATT AT AGGGGGAAGCG AAAG AG AAGATG ACC TTC AAAAATTAGAAAAT AGAAT 
AAAAGAATTAAACTTAAACATTGAACATCTAAACTGGTATCTTGATCTAAGAAGATTTGGCTCGGCTCCTCATTCT 
GGCTTTGGACTTGGACTTGAAAGATTGGTGCAATACTCAACAGGAATATCTAATATAAGAGATTCAATACCATTCC 

CAAGGACTCCTAAAAATCTTTATTTTTAA 
f'44 6-.aa 

" MKILRLCLLFLFFACTFDYDEYSSRSDVAKKFPSIQILGIKYYDVVYNKEQTVLNS-LSFSYFNDYKIYKAENGRFL 

YHSLDNEISGKFNNLEGSYITKDLDMRDSVE^ 
KGFSYFLKSNVFYFDSGVEGIMNZ . 

t446.aa 

CTFDYDEYS SRSDVAKKF PS I Q I LGI KYYDVVYNKEQTVLNS LS F S YFNDYKI YKAENGRFL YHS LDNE I SGKFNN 

LEGSYITKDLDMRDSVEFKIEDKOT^ 

DSGVEGIMNZ ' 

f446.nt * ' 

ATGAAAATACTTAGACTTTGTTTGTTGTTTTTGTTTTTTGCTTGTACTTTTGATTATGATGAGTATTCTAGTAGAT 
CTGATGTGGCCAAAAAGTTTCCTTCAATACAAATATTAGGAATCAAGTATTATGATGTTGTATACAATAAAGAGCA 

AAC CGTTTT AAATTCTTT AAGC TTTAGTTATTTC AATGAC TAT AAAATTT AT AAGGC AGAG AATGGAAGGTTTTT A 
TATCATTCCCTAGATAATGAAATTTCAGGGAAGTTTAATAATTTGGAAGGTTCTTATATTACAAAGGATTTGGATA 
TGAGAGATTCTGTAGAATTTAAAATAGAAGATAAAAATAATTATTATTTGCTTAATTCAAATAGGCTTTTATGGAA 
GAATAAAGACAAGAAGTTGCAATCCCCCCCAAATGAGCTAGTATTAATTAGATTTAATGATAGCAAAATAAACGGA 
AAAGGATTTTCTTATTTTTTAAAGAGCAATGTTTTTTATTTTGATTCTGGAGTTGAAGGAATCATGAATTGA 

t446 .nt 

TGTACTTTTGATTATGATGAGTATTCTAGTAGATCTGATGTGGCCAAAAAGTTTCCTTCAATACAAATATTAGGAA 
TCAAGTATTATGATGTTGTATACAATAAAGAGCAAACCGTTTT AAATTCTTT AAGC TTTAGTTATTTCAATGACT A 

TAAAATTT AT AAGGC AG AGAATGGAAGGTTTTTAT ATC ATTC C C T AG AT AATGAAATTTC AGGGAAGTTTAAT AAT 
TTGGAAGGTTCTTATATTACAAAGGATTTGGATATGAGAGATTCTGTAGAATTTAAAATAGAAGATAAAAATAATT 
ATTATTTGCTTAATTCAAATAGGCTTTTATGGAAGAATAAAGACAAGAAGTTGCAATCCCCCCCAAATGAGCTAGT 
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ATT'AATTAGATTTAATGATAGCAAAATAAACGGAAAAGGATTTTCTTATTTTTTAAAGAGCAATGTTTTTTATTTT 
GATTCTGGAGTTGAAGGAATCATGAATTGA . 

f457.aa - 

MKQKLSWILLFCFLSCRSESRLAENVLIEFFDSIKNFQSSPEIFFNYLNIPSDDDLKAKIRGLKSQAKDDFIFYPL 
FFNNLRYEIIGRKNISKGFEFEWIKNINFQNGIEKFLAKLNKIEGRSLNIKNLEKKERKKIFDNLINEVIGELDD 
FDYTEWHFFRWKSSSESYKIELLGDVLNIQSRNKLINDLFLVLSPGIZ 

t457.aa 

C p LSCRSESRLAENVLIEFFDSIKNFQSSPEIFFNYLNIPSDDDLKAKIRGLKSQAKDDFIFYPLFFNNLRYEIIG 
RKNISKGFEFEVVIKNINFQNGIEKFLAKLNKIEGRSLNIKNLEKKERKKIFDNLINEVIGELDDFDYTEVVHFFR 

WKSSSESYKIELLGDVLNIQSRNKLINDLFLVLSPGIZ . . . 
f457 .nt ' ■ ' 

ATGAAGCAAAAATTAAGTTGGATTTTATTATTTTGTTTTTTGTCTTGTAGATCTGAATCTAGATTGGCTGAAAATG 
TTTTAATAGAGTTTTTTGATTCTATTAAAAATTTTCAAAGCAGTCCTGAAATATTTTTTAATTATTTAAATATTCC 
AAGTGATGATGATCTGAAGGCAAAAATTCGTGGGTTGAAATCTCAGGCAAAGGATGATTTCATTTTTTATCCTTTG 
TTTTTTAATAATCTAAGATATGAGATAATAGGTAGAAAAAATATTTCTAAGGGCTTTGAATTTGAAGTTGTTATTA 
AAAATATTAACTTTCAAAACGGTATAGAAAAATTTTTGGCTAAATTAAATAAAATTGAAGGGAGATCTTTAAATAT 
TAAAAATTTAGAAAAAAAAGAGCGTAAAAAAATATTTGACAATTTAATAAATGAAGTTATTGGAGAGTTGGATGAT 
TTTGATTACACTGAAGTTGTTCATTTTTTTAGAGTAGTTAAGAGTTCTTCTGAAAGTTATAAAATAGAGCTTTTAG 
GAGATGTTTTAAATATACAGTCTAGAAATAAGCTTATTAATGATCTTTTTTTGGTTTTATCGCCTGGAATTTAA 

t457.nt 

TGTTTTTTGTCTTGTAGATCTGAATCTAGATTGGCTGAAAATGTTTTAATAGAGTTTTTTGATTCTATTAAAAATT 
TTCAAAGCAGTCCTGAAATA'TTTTTTAATTATTTAAATATTCCAAGTGATGATGATCTGAAGGCAAAAATTCGTGG 
GTTGAAATCTCAGGCAAAGGATGATTTCATTTTTTATCCTTTGTTTTTTAATAATCTAAGATATGAGATAATAGGT 
AGAAAAAATATTTCTAAGGGCTTTGAATTTGAAGTTGTTATTAAAAATATTAACTTTCAAAACGGTATAGAAAAAT 
TTTTGGCTAAATTAAATAAAATTGAAGGGAGATCTTTAAATATTAAAAATTTAGAAAAAAAAGAGCGTAAAAAAAT 
ATTTGACAATTTAATAAATGAAGTTATTGGAGAGTTGGATGATTTTGATTACACTGAAGTTGTTCATTTTTTTAGA 
GTAGTTAAGAGTTCTTCTGAAAGTTATAAAATAGAGCTTTTAGGAGATGTTTTAAATATACAGTCTAGAAATAAGC 

TTATTAATGATCTTTTTTTGGTTTTATCGCCTGGAATTTAA 
f542.aa ' . 

^IVIFIPGILLTSCFSRNGIESSSKKIKISMLVDGVLDDKSFNSSANEALLRLKKDFPENIEEVFSCAISGVYSS 
YVSDLDNLKRNGSDLIWLVGYMLTDASLLVSSENPKISYGIIDPIYGDDVQIPENLIAWFRVEPRCFFGWLYCSQ 

KKLFWQNRFYRGNEGZ . " 

t542.aa ; 

"CFSRNGIESSSKKIKISMLVTCVLDDKSFNSSANEALLRLKKDFPENIEEVFSCAISGVYSSYVSDLDNLKRNGSD 
LIWLVGYMLTDASLLVSSENPKISYGIIDPIYGDDVQIPENLIAWFRVEPRCFFGWLYCSQKKLFWQNRFYRGNE 

GZ 

f542.nt 

ATGAGAATTGTAATTTTTATATTCGGTATTTTGTTGACTTCTTGCTTTAGTAGAAATGGAATAGAA'TCTAGTTCAA 
AAAAAATTAAGATATCCATGTTGGTAGATGGTGTTCTTGACGACAAATCTTTTAATTCTAGTGCTAATGAGGCTTT 
ATTACGCT'TGAAAAAAGATTTTCCAGAAAATATTGAAGAAGTTTTTTCTTGTGCTATTTCTGGAGTTTATTCTAGT 
TATGTTTC^GATC^TG^TAATTTAAAAAGGAATGGCTCAGACTTGATTTGGCTTGTAGGGTACATGCTTACGGACG 
CATCTTTATTGGTTTCATCGGAGAATCCAAAAATTAGCTATGGAATAATAGATCCCATTTATGGTGATGATGTTCA 
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GATTCCTGAAAACTTGATTGCTGTTGTTTTCAGAGTAGAGCCAAGGTGCTTTTTTGGCTGGCTATATTGCAGCCAA 
AAAAAGCTTTTCTGGCAAAATAGGTTTTATAGGGGGAATGAAGGGTAA 

t'542.nt • 

TGCTTTAGTAGAAATGGAATAGAATCTAGTTCAAAAAAAATTAAGATATCCATGTTGGTAGATGGTGTTCTTGACG 

ACAAATCTTTTAATTCTAGTGCTAATGAGGCTTTATTACGCTTGAAAAAAGATTTTCCAGAAAATATTGAAGAAGT 

TTTTTCTTGTGCTATTTCTGGAGTTTATTCTAGTTATGTTTCAGATCTTGATAATTTAAAAAGGAATGGCTCAGAC ■ 

TTGATTTGGCTTGTAGGGTACATGCTTACGGACGCATCTTTATTGGTTTCATCGGAGAATCCAAAAATTAGCTATG 

GAATAATAGATCCCATTTATGGTGATGATGTTCAGATTCCTGAAAACTTGATTGCTGTTGTTTTCAGAGTAGAGCC 

AAGGTGCTTTTTTGGCTGGCTATATTGCAGCCAAAAAAAGCTTTTCTGGCAAAATAGGTTTTATAGGGGGAATGAA 

GGGTAA - 

f 93 .aa . ; . 

MKRI LAMHDI SSMGRTS LTIC I P VI SSFNMQVC PFVTAVLSASTAYKKFEI VDLTDHLEKF INIWKEQNEHFDI L Y 
TGFLGSEKQQITIEKIIKLIKFEKIVIDPVFADDGEIYPIFDNKIISGFRKIIKYANIITPNITELEMLSKSSKLN 
NKDDIIKAILNLDTKATVWTSVKRGNLLGNICYNPKNKEYSEFFLEGLEQNFSGTGDLFTSLLIGYLEKFETEQA 
r LEKTTKAIHLIIKESIKENVSKKEGVRIENFLKNTFZ ' . - 

t93 .aa ' 

CIPVISSFNMQVCPFVTAVLSASTAYKKFEIVDLTDHLEKFINIWKEQNEHFDILYTGFLGSEKQQITIEKIIKLI 
KFEKIVIDPVFADDGEIYPIFDNKIISGFRKIIKYANIITPNITELEMLSKSSKLNNKDDIIKAILNLDTKATVW 
TSVKRGNLLGNICYNPKNKEYSEFFLEGLEQNFSGTGDLFTSLLIGYLEKFETEQALEKTTKAIHLIIKESIKENV 

S KKEGVRI ENF LKNTFZ 

f93.nt " ■ - 

ATGAAAAGAATTTTAGCAATGCATGATATTTCAAGCATGGGAAGAACATCTCTTACAATATGCATACCAGTAATAT 

CTTCGTTTAATATGCAAGTTTGTCCTTTTGTGACAGCTGTCCTTTCTGCTTCCACAGCTTATAAAAAATTTGAAAT 

AGTGGATTTAACCGATCATTTAGAAAAATTTATCAATATATGGAAAGAACAAAATGAGCACTTTGACATACTCTAT 

ACCGGATTTCTGGGAAGCGAAAAACAACAAATAACAATAGAGAAAATAATTAAATTAATAAAATTTGA^ 

TAATTGATCCTGTGTTTGCTGACGATGGAGAAATTTACCCTATATTTGATAATAAAATAATTAGTGGATTTAGAAA 

AATCATAAAGTACGCAAACATAATAACACCCAATATCACAGAACTTGAAATGCTAAGCAAAAGCTCAAAAC^ 

AACAAAGATGATATCATAAAAGCAATATTAAATCTTGATACAAAAGCGACGGTAGTTGTTACAAGCGTTAAAAGGG 

GAAATCTCTTGGGAAACATTTGCTAeAATCCTAAAAACAAAGAATACTCGGAGTTTTTTTTAGAAGGATTAGAACA 

AAATTTCAGTGGAACAGGAGATTTATTTACCAGCTTACTTATAGGATATTTGGAAAAATTTGAAACAGAGCAAGCC 

TTAGAAAAAACAACAAAGGCTATTCACCTAATAATAAAAGAGTCAAT^ 

TCCGAATTGAAAATTTCTTAAAAAATACATTTTGA . % 
t93.nt 

TGCATACCAGTAATATCTTCGTTTAATATGCAAGTTTGTCCTTTTGTGACAGCTGTCCTTTCTGCTTCCACAGCTT 
AT AAAAAATTTGAAAT AGTGG ATT.T AACCGATC ATTT AGAAAAATTT ATC AATAT ATGGAAAGAAC AAAATGAGC A 
' C TTTGAC AT AC TCT AT ACCGGATTTC TGGGAAGCGAAAAAC AAC AAATAAC AAT AGAGAAAAT AATTAAATT AAT A 
AAATTTGAAAAAATTGTAATTGATCCTGTGTTTGCTGACGATGGAGAAATTTACCCTATATTTGATAATAAAATAA 
TTAGTGGATTTAGAAAAATCATAAAGTACGCAAA.CATAATAACACCCAATATCACAGAACTTGAAATGCTAAGCAA 
AAGCTCAAAACTTAACAACAAAGATGATATCATAAAAGCAATATTAAATCTTGATACAAAAGCGACGGTAGTTGTT 
AC AAGCGTTAAAAGGGGAAATCTCTTGGGAAACATTTGCTACAATCCTAAAAACAA-AGAATACTCGGAGTTTTTTT 
TAGAAGGATTAGAACAAAATTTCAGTGGAACAGGAGATTTATTTACCAGCTTACTTATAGGATATTTGGAAAAA.TT 
TGAAACAGAGCAAGCCTTAGAAAAAACAACAAAGGCTATTCACCTAATAATAAAAGAGTCAATTAAAGAAAATGTT 
TCAAAAAAAGAAGGGGTCCGAATTGAAAATTTCTTAAAAAATACATTTTGA 



f 105 . aa 
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MC-r YT XLLRQSINLKSLFPLSVLFFSCNWDTDFSVLEFKVANFNLNDDFSQGLLDSAYNILNRSFDLIIIKNLKN 
:^-^DLINNRVLFRAFKNAYFIDQGSGLSVSILSKRKINIKVLSVMQDSCDLKLGLLVDFKFENNHYGIVIYNLSK 
DFIKSIANLQISEQILYLK^QMDKLMFILDESEFVIFDLLIKNGFFSLINDSNYTSMLANKIDFRVFSNFFARVSL 

YSFMFVIADYLHSNYWENFPQKIVINZ 

clOS . aa • ' . ' 

CNV^/D^DFSVLEFKVANFNLNDDFSQGLLDSAYNILNRSFDLIIIKNLKNMWLDLINNRVLFRAFKNAYFIDQGS 
GT-SVSTLSKRKINIKVLSVMQDSCDLKLGLLVDFKFEN^YGIVIYNLSKDFIKSIANLQISEQILYLKAQMDKLM 
FILDESEFVIFDLLIKNGFFSLINDSNYTSMLANKIDFRVFSNFFARVSLYSFMFVIADYLHSNYWENFPQKIVI 

NZ ' • 

£105. nt , . • ' 

ATGGGCTTGTATTTGAAGTTGTTGAGACAAAGTATCAACTTGAAGAGTTTATTTCCGCTTAGTGTTTTATTTTTTT 

CCTG^AATGTTGTAGATACAGATTTTAGTGTTTTGGAGTTTAAGGTTGCAAATTTTAATTTAAATGATGATTTTTC 

TCAAGGGTTACTTGATTCTGCTTATAATATTCTAAATCGAAGTTTTGATTTAATAATTATTAAGAATCTTAAGAAT 

AAAA* TGTTCTTGATTTAATTAATAATAGAGTTTTATTTAGAGCTTTTAAGAATGCTTATTTTATTGATC AAGGTA ■ 

GTC-GCCTTTCTGTTAGCATTCTTTCTAAGCGCAAAATAAATATTAAAGT.TTTAAGTGTAATGCAAGATTCTTGCGA 

T-TTAAAATTAGGATTGCTTGTGGATTTTAAATTTGAGAATAATCACTATGGTATTGTTATTTATAATTTAAGCAAG 

GAT^TTATTAAAAGTATTGCCAATTTGCAAATTAGTGAACAAATTTTATATTTAAAAGCCCAAATGGATAAATTGA 

TGT'TTATTTTAGATGAATCTGAATTTGTTATTTTTGATTTATTAATCAAAAATGGATTTTTTAGCTTAATAAATGA 

^TCAAACTACACTTCAATGTTAGCAAATAAAATTGATTTTAGAGTTTTTTCTAATTTTTTTGCTAGGGTTTCTTTA 

TATTCATTTATGTTTGTAATTGCAGATTATTTGCATAGCAATTATGTTGTTGAGAATTTTCCTCAAAAAATAGTTA 

TCAATTGA ' • 

tl05 .nt 

T , GTAATGTTGTAGATACAGATTTTAGTGTTTTGGAGTTTAAGGTTGCAAATTTTAATTTAAATGATGAT_TTTTCTC 
AAGGGTTACTTGATTCTGCTTATAATATTCTAAATCGAAGTTTTGATTTAATAATTATTAAGAATCTTAAGAATAA 
AAATGTTCTTGATTTAATTAATAATAGAGTTTTATTTAGAGCTTTTAAGAATGCTTATTTTATTGATCAAGGTAGT 
GGCCTTTCTGTTAGCATtCTTTCTAAGCGCAAAATAAATATTAAAGTTTTAAGTGTAATGCAAGATTCTTGCGATT 
T'AAAATTAGGATTGCTTGTGGATTTTAAATTTGAGAATAATCACTATGGTATTGTTATTTATAATTTAAGCAAGGA 
TTTTATTAAAAGTATTGCCAATTTGCAAATTAGTGAACAAATTTTATATTTAAAAGCCCAAATGGATAAATTGATG 
T'TTATTTTAGATGAATCTGAATTTGTTATTTTTGATTTATTAATCAAAAATGGATTTTTTAGC.TTAATAAATGATT 
CAAACTACACTTCAATGTTAGCAAATAAAATTGATTTTAGAGTTTTT'TCTAATTTTTTTGCTAGGGTTTCTTTATA 
TTCATTTATGTTTGTAATTGCAGATTATTTGCATAGCAATTATGTTGTTGAGAATTTTCCTCAAAAAATAGTTATC 

AATTGA 
£150. aa 

MKTFVIIGLSNLGIHLLEDLSRLDCQIIIIDTSKELIEEYDVIStESFWEQFTKNALKRIIPVDTDAWIDFDDD 
LGKSALVTHYCNLLGLKEICVKTENRDDAEILKTLGATKIIFPSKDAARRLTPLLVSPNLSTYNIIGYDIIVAETV 
IPKEYVGKTLFEADLRRECGITVIAVRNLSNSRYEFVDGDYFFLKDDKIVICGKPDSIENFTNNKDLIKDLISGSK 

EDENLNKDAEKKSRFLGIFNFMKIFQKDRKDNZ 

tlSO.aa 

CQI 1 1 IDTSKELIEEYDVISTESFWEQFTKNAXjKRI I PVDTDAWIDFDDDLGKSALVTKYCNLLGLKEICVKTE 
NRDDAEILKTLGATKIIFPSKDAARRLTPLLVSPNLSTYNIIGYDIIVAETVIPKEYVGKTLFEADLRRECGITVX 
AV11NLSNSRYEFVDGDYFFLKDDKIVICGKPDSIENFTNNKDLIKDLISGSKEDENLNKDAEKKSRFLGIFNFMKI 

FQKDRKDMZ . 
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ATGAAAACATTTGTTATTATTGGACTTAGTAATTTAGGCATTCACTTACTTGAAGATTTAAGCAGGCTTGATTGTC 
• . AAATTATTATTATAGATACATCTAAAGAGCTTATTGAAGAATATGATGTGATATCTACAGAAAGCTTTGTTGTTGA 
GCAATTCACTAAAAATGCTTTGAAAAGAATAATTCCAGTAGATACAGACGCTGTTGTTATTGATTTTGATGATGAT 
CTTGGCAAAAGTGCTCTTGTTACTCACTATTGTAATCTTTTAGGTTTGAAAGAAATATGCGTTAAGACAGAAAATA 
GAGATGATGCTGAAATCTTAAAJ^CTCTTGGGGCAACAAAAATTATATTTCCAAGTAAAGATGCTGCAAGAAGATT 
" AACTCCATTATTAGTATGTCCAAATCTTTCAACTTATAATATTATTGGGTATGATATTATTGTTGCTGAAACTGTT 
ATTCCCAAAGAATATGTTGGTAAAACTCTTTTTGAAGCCGATCTTAGAAGAGAATGTGGGATTACAGTTATTGCTG 
TTAGAAATTTAAGTAATTCTAGGTATGAATTTGTTGATGGCGATTATTTTTTTTTAAAAGATGATAAAATTGTAAT 
TTGTGGTAAACCAGATAGCATTGAAAATTTTACAAATAATAAAGATTTAATTAAAGATTTAATTTCAGGCTCTAAA 
GAGGATGAJ\AATTTAAATAAJ^GATGCTGAGAAAAAATCTAGATTTTTAGGGATTTTCAATTTTATGAAAATTTTTC" 

AAAAAGATCGTAAGGATAATTAG . - 

' < "tlSO.nt . 

TGTCAAATTATTATTATAGATACATCTAAAGAGCTTATTGAAGAATATGATGTGATATCTACAGAAAGCTTTGTTG 
TTGAGCAATTCACTAAAAATGCTTTGAAAAGAATAATTCCAGTAGATACAGACGCTGTTGTTATTGATTTTGATGA 

(*} TGATCTTGGCi vj^^ 

^ AATAGAGATGATGCTGAAATCTTAAAAACTCTTGGGGCAACAAAAATTATATTTCCAAGTAAAGATGCTGCAAGAA 

tfi GATTAACTCCATTATTAGTATCTCCAAATCTTTCAACTTATAATATTATTGGGTATGATATTATTGTTGCTGAAAC 

UJ TGTTATTCCCAAAGAATATGTTGGTAAAACTCTTTTTGAAGCCGATCTTAGAAGAGAATGTGGGATTACAGTTATT 
Q * GCTGTTAGAAATTTAAGTAATTCTAGGTATGAATTTGTTGATGGCGATTATTTTTTTTTAAAAGATGATAAAATTG 

F1J TAATTTGTGGTAAACCAGATAGCATTGAAAATTTTACAAATAATAAAGATTTAATTAAAGATTTAATTTCAGGCTC 

UJ TAAAGAGGATGAAAATTTAAATAAAGATGCTGAGAAAAAATCTAGATTTTTAGGGATTTTCAATTTTATGAAAATT- 

□ TTTC AAAAAGATCGTAAGGATAATTAG " • 

□ f219.aa ' ■ . ' " ' _ . 

«1 MLIARIMNINTLFYGMIIIIFALISCNHKNIQYDKRIKKFLDKWKIEYKIDSENDFIAFKDINNNEKEEVIIRSRL 
M NSYKNSKIREIFGIVKVFDINTPKIKEISDSLMSDSYNNRVFGSWEIIHNAERGINSLVYIVKAEEFANDTFLLDA 
IDE IASTISIFKKIITTNNENIDNNEENNNTNESNEQPTLKQEKTNSTKESNNELKEDQIEEELQEIKAQZ 

? "' ? t219 .aa' . 

CNHKNIQYDKRIKKFLDKNKIEYKIDSENDFIAFKDINNNEKEEVIIRSRLNSYKNSKIREIFGIVKVFDINTPKI 
KEISDSLMSDSYNNRVFGSWEIIHNAERGINSLVYIVKAEEFANDTFLLDAIDEIASTISIFKKIITTNNENIDNN 

EENNNTNESNEQPTLKQEKTNSTKESNNELKEDQIEEELQEIKAQZ 
f219 .nt ■ 

' ATGCTAATTGCAAGAATAATGAATATTAATACATTATTCTACGGCATGATCATTATCATTTTTGCACTCATTTCTT 
GCAATCATAAGAATATACAGTACGACAAGAGAATTAAAAAATTTTTAGATAAAAACAAAATTGAATATAAAATAGA 
CTCAGAAAATGACTTTATAGCATTTAAAGATATAAACAATAACGAAAAAGAAGAAGTAATCATCAGATCAAGACTA 
AACTCATATAAAAATTCAAAGATAAGAGAAATATTTGGAATTGTTAAAGTATTTGATATAAACACACCAAAAATAA 
AAGAAATATCTGACTCGCTTATGAGCGATAGTTATAATAACAGAGTATTTGGATCGTGGGAGATTATTCATAATGC 
AGAAAGAGGAATCAACTCTTTGGTATATATTGTAAAAGCAGAAGAATTTGCAAATGATACATTTTTGCTTGATGCA 
<. ATTGATGAGATTGCCTCAACAATAAGTATTTTCAAAAAAATAATAACAACCAACAACGAAAACATTGATAATAATG 
AAGAAAATAACAATACAAATGAATCAAATGAACAGCCCACCTTAAAGCAAGAAAAAACAAATTCAACAAAAGAATC 
' TAATAACGAACTTAAAGAAGATCAAATAGAAGAAGAACTTCAAGAAATCAAAGCCCAATAA 

t219.nt 

TGCAATCATAAGAATATACAGTACGACAAGAGAATTAAAAAATTTTTAGATAAAAACAAAATTGAATATAAAATAG 
A.CTCAGAAAATGACTTTATAGCATTTAAAGATATAAACAATAACGAAAAAGAAGAAGTAATGATCAGATCAAGACT 
AAACTCATATAAAAATTCAAAGATAAGAGAAATATTTGGAATTGTTAAAGTATTTGATATAAACACACCAAAAATA 
AAAGAAATATCTGACTCGCTTATGAGCC-ATAGTTATAATAACAGAGTATTTGGATCGTGGGAGATTATTCATAATG 
C^GAAAGAGGAATCAACTCTTTGGTATATATTGTAAAAGCAGAAGAATTTGCAAATGATACATTTTTGCTTGATGC 
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AATTGATGAGATTGCCTCAACAATAAGTATTTTCAAAAAAATAATAACAACCAACAACGAAAACATTGATAATAAT 
GAAGAAAATAACAATACAAATGAATCAAATGAACAGCCCACCTTAAAGCAAGAAAAAACAAATTCAACAAAAGAAT 
CTAATAACGAACTTAAAGAAGATCAAATAGAAGAAGAACTTCAAGAAATCAAAGCCCAATAA 

f 229 .aa . , - 

MRVDLLPLVELSLYINLSFCCKDFSIFNRILEELKCHLILLGHPIIKTLYIKHVDFCLSRQDNLKFIFTSLSKYIN 
LELLEEFTLEIIPGYVDFEKFKLLDEFCITRINLNVQSFSLEFRKIVGIPEISYKKLNILINNIRKFPFDLNIDMT 

VNMPLQKKSHLKRDLQRIAFIYAZ 

t229.aa • , : 

CKDFSIFNRILEELKCHLILLGHPIIKTLYIKHVDFCLSRQDNLKFIFTSLSKYINLELLEEFTLEIIPGYVDFEK 
FKLLDEFCITRINLNVQSFSLEFRKIVGIPEISYKKLNILINNIRKFPFDLNIDMTVNMPLQKKSHLKRDLQRIAF 

IYAZ " ' 

f229.nt - ' 

ATGAGAGTAGATCTTTTACCTCTTGTCGAGTTAAGTCTTTATATTAATTTGTCATTTTGTTGTAAAGATTTTAGCA 
TTTTTAATAGAATTTTAGAGGAATTAAAATGTCATTTAATCTTGCTGGGTCATCCAATTATAAAAACACTTTACAT 
TAAGCACGTAGATTTTTGTTTATCTAGGCAAGATAATTTAAAATTTATTTTCACTTCTTTGTCCAAGTATATTAAT 
TTGGAGTTATTAGAAGAATTTACTTTAGAAATTATTCCGGGTTATGTTGATTTTGAAAAATTCAAACTTTTGGATG 
AATTTTGTATTACTAGAATTAATCTTAATGTTCAAAGTTTTTCTTTAGAGTTTAGAAAGATTGTGGGGATACCCGA 
AATTTCTTATAAAAAATTGAATATTTTGATTAACAATATTAGAAAGTTTCCTTTTGATTTGAATATTGACATGACT 
GTCAATATGCCTTTGCAAAAAAAATCTCATCTCAAGCGAGATTTGCAAAGAATTGCTTTCATATATGCCTGA . 

t22 9 .nt " . . '. . • 

TGTAAAGATTTTAGCATTTTTAATAGAATTTTAGAGGAATTAAAATGTCATTTAATCTTGCTGGGTCATCCAATTA 
TAAAAACACTTTACATTAAGCACGTAGATTTTTGTTTATCTAGGCAAGATAATTTAAAATTTATTTTCACTTCTTT 
GTCCAAGTATATTAATTTGGAGTTATTAGAAGAATTTACTTTAGAAATTATTCCGGGTTATGTTGATTTTGAAAAA 
TTCAAACTTTTGGATGAATTTTGTATTACTAGAATTAATCTTAATGTTCAAAGTTTTTCTTTAGAGTTTAGAAAGA 
TTG'TGGGGATACCCGAAATTTCTTATAAAAAATTGAATATTTTGATTAACAATATTAGAAAGTTTCCTTTTGATTT 
GAATATTGACATGACTGTCAATATGCCTTTGCAAAAAAAATCTCATCTCAAGCGAGATTTGCAAAGAATTGCTTTC 

ATATATGCCTGA • 
■f 22 .aa • 

MLKTLTKIITISCLIVGCASLPYTPPKQNLNYLMELLPGANLYAHVNLIKNRSIYNSLSPKYKSVLGLISNLYFSY 
KKENNDFALLIMGNFPKDIFWGIHKNRNTESIGNIFTNPKWKLKNSNIYIIPNKARTSIAITQKDITAKDNNMLTT 
KYIGEIEKNEMFFWIQDPTLLLPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVL 

TNMTNLTISSHIKTTIKDQNTVEIEFNIQKSSVESLIEKLASNIQT • 
t22.aa 

CAS LPYT PPKQNLNYLMELLPGANLYAHVNL IKNRS I YNS LS PKYKSVLGLI SNLYFSYKKENNDF ALL IMGNF PK 
DIFWGIHKNRNTESIGNIFTNPKWKLKNSNIYIIPNKARTSIAITQKDITAKDNNMLTTKYIGEIEKNEMFFWIQD 
PTLLLPNQIVSSKNLIPFSSGTLSINSLNQEEYIFKSLIKTNNPPILKILSKKLIPTVLTNMTNLTISSHIKTTIK 

DQNTVEIEFNIQKSSVESLIEKLASNIQT 

f22.nt • ' 

ATGTTAAAAAC^TTAACAAAAATAATTACCATTTCATGCCTCATAGTGGGATGCGCAAGCCTGCCTTACACTCCTC 
CAAAACAAAATCTAAATTACTTAATG<3AACTTTTACCTGGCGCAAATTTATACGCCCATGTAAATTTAATTAAAAA 
CAGGTCTATTTATAACTCTT-TAAGCCCTAAATATAAATCAGTTCTTGGGCTTATAAGCAATTTATACTTTAGCTAT 
AAAAAAGAAAATAACGATTTTGCTCTACTAATAATG^TAATTTCCCAAAAGATATTTTCTGGGGAATTCATAAAA 
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ATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAACTTAAAAATTCAAATATATACATTAT 
TCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAAAGATATAACCGCAAAAGACAATAATATGCTAACAACA 
AAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTCAAGATCCAACATTATTGCTCCCAAACCAAA 
TAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAAACAGCTTAAATCAAGAAGAATATAT 
TTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTCAAAAAAGTTAATTCCAACCGTCTTG 
ACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAAGACCAAAATACGGTTGAAATAGAAT 
TT AAT ATTC AAAAATC T AGTGTTG AAAGC C TT AT AG AAAAAC T AGC TTC AAAT ATTC AAAC C TAA 

t22.nt 

TGCGCAAGCCTGCCTTACACTCCTCCAAAACAAAATCTAAATTACTTAATGGAACTTTTACCTGGCGCAAATTTAT 
ACGCCCATGTAAATTTAATTAAAAACAGGTCTATTTATAACTCTTTAAGCCCTAAATATAAATCAGTTCTTGGGCT 
TATAAGCAATTTATACTTTAGCTATAAAAAAGAAAATAACGATTTTGCTCTACTAATAATGGGTAATTTCCCAAAA 
GATATTTTCTGGGGAATTCATAAAAATAGAAATACAGAATCAATAGGCAATATATTTACAAATCCAAAATGGAAAC 
TTAAAAATTCAAATATATACATTATTCCAAACAAAGCTAGAACTAGCATTGCAATAACCCAAAAAGATATAACCGC 
AAAAGACAATAATATGCTAACAACAAAATATATTGGGGAAATAGAAAAAAATGAAATGTTTTTTTGGATTCAAGAT 
CCAACATTATTGCTCCCAAACCAAATAGTAAGCAGCAAAAATTTAATTCCCTTTAGCAGTGGAACTTTGTCTATAA 
ACAGCTTAAATCAAGAAGAATATATTTTTAAATCCTTAATCAAAACAAATAATCCACCAATACTAAAAATATTGTC 
AAAAAAGTTAATTCCAACCGTCTTGACAAACATGACAAACCTCACAATATCAAGCCACATAAAGACCACAATAAAA 
GACCAAAATACGGTTG AAAT AGAATTT AAT ATTC AAAAATCTAGTGTTGAAAGCCTTATAGAAAAACTAGC TTC AA 

ATATTCAAACCTAA 
f32 .aa 

MNTKTLYLISLILLACNKNNklPLIQKLDLPKSSILGFSNKMGIIIKDYAFLSKSTKKNSELDYDYAILLRKDEW 
KIEKTLEKTERYGIEGNWILVNYKGTKRYIFSKDINIVNNLIIDHSK 

t32.aa ' ' 

CNKNNKIPLIQKLDLPKSSILGFSNKMGIIIKDYAFLSKSTKKNSELDYDYAILLRKDEWKIEKTLEKTERYGIE 
GNWILVNYKGTKRYIFSKDINIVNNLIIDHSK - 

f32 .nt 

ATGAATACAAAAACATTATATTTAATATCCTTAATTCTTTTAGCTTGCAATAAAAATAACAAAATTCCTCTCATTC 
AAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATAAAATGGGCATAATAATAAAAGATTATGCTTT 
TCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTACGCAATTCTACTCAGAAAAGACGAAGTCGTA 
AAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAAGGAAATTGGATCCTAGTCAATTACAAGGGAA 
CTAAAAGATACATCTTTAGCAAAGACATCAATATA'GTCAACAATTTAATAATTGATCATTCTAAATAG 

t32.nt ' • 

TGCAATAAAAATAACAAAATTCCTCTCATTCAAAAATTAGATTTGCCCAAAAGCAGCATTCTTGGCTTTAGCAATA 
AAATGGGCATAATAATAAAAGATTATGCTTTTCTTAGTAAAAGCACTAAGAAAAATAGCGAATTGGATTATGATTA 
CGCAATTCTACTCAGAAAAGACGAAGTCGTAAAAATTGAAAAAACACTAGAAAAAACAGAGCGCTATGGAATTGAA 
GGAAATTGGATCCTAGTCAATTACAAGGGAACTAAAAGATACATCTTTAGCAAAGACATCAATATAGTCAACAATT 

TAATAATTGATCATTCTAAATAG • 

fl86.aa • 

MKKLIIIFTLFLSQACNLSTMHKIDTKEDMKILYSEIAELRKKLNLNHLEIDDTLEKVAKEYAIKLGENRTITKTL 
FGTTPMQRI-HKYDQSFNLTREIIASGIELNRVVNAWLNSPSKKEALINTDTDKIGGYRLKTTDNIDIFVVLFGKRK 

YKN , . 

tl86.aa 
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CNLSTMHKIDTKEDMKILYSEIAELRKKLNLNHLEIDDTLEKVAKEYAIKLGENRTITHTLFGTT 
' pMQRIHKYDQSFNLTREILASGIELNRVVNAWLNSPSHKFJVLINTDTDKIGGYRLKTTDNIDIFVVLFGKRKYKN 

fl86.nt 

ATGAAAAAATTGATTATAATTTTTACACTGTTTTTATCTCAAGCATGCAATTTAAGTACAATGCATAAAATAGATA 
C AAAAGAAG ATATGAAAATTC T AT ATTC AG AAATTGC TGAATT GAG AAAAAAATTAAATC TAAAC CATC TAG AAAT 
AGATGATACCCTTGAAAAAGTTGCAAAAGAATATGCCATTAAACTGGGAGAAAATAGAACAATAACTCACACCCTT 
TTTGGCACAACCCCAATGCAAAGAATACATAAATACGATCAATCCTTTAATTTAACAAGAGAAATACTGGCATCAG 
GAATTGAACTTAACAGAGTAGTTAATGCATGGCTTAATAGTCCAAGCCACAAAGAAGCTCTTATTAATACAGATAC 
CGATAAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGATATATTTGTAGTTCTTTTTGGAAAAAGAAAA 

TATAAGAATTGA 

t!86 .nt 1 ■ ' 

TGCAATTTAAGT^CAATGCATAAAATAGATACAAAAGAAGATATGAAAATTCTATATTCAGAAATTGCTGAATTGA 
GAAAAAAATTAAATCTAAACCATCTAGAAATAGATGATACCCTTGAAAAAGTTGCAAAAGAATATGCCATTA^CT 
GGGAGAAAAT^GAACAATAACTCACACCCTTTTTGGCACAACCCCAATGCAAAGAATACATAAATACGATCAATCC 
TTTAATTTAACAAGAGAAATACTGGCATCAGGAATTGAACTTAACAGAGTAGTTAATGCATGGCTTAATAGTCCAA 
GCCACAAAGAAGCTCTTATTAATACAGATACCGATAAAATAGGTGGCTATAGATTAAAAACGACTGACAATATAGA 
TATATTTGTAGTTCTTTTTGGAAAAAGAAAATATAAGAATTGA 

- f216 .aa * ' - ' 

MIRVLLGSLAVSFLFSICMVFLNYDNLFSKKVFYFHSSKGFVANLRYLRDEQNLKDNLDLLVKDFLLGSNEGFSFG 
FLLSDSRFLYSFLKNGVYYVNLSREFYDSFNNGDYNESNESFDVKVNLFAMSLIKTMRFNYPGKIKKIVILVEGCI 

LKEQS 
t216 . aa 

CMVFLNYDNLFSKKVFYFHSSKGFVANLRYLRDEQNLKDNLDLLVKDFLLGSNEGFSFGFLLSDSRFLYSFLKNGV 
YYVNLSREFYDSFNNGDYNESNESFDVKVNLFAMSLIKTMRFNYPGKIKKIVILVEGCILKEQS 

f216 .nt _ 

ATGATTAGGGTGCTTTTGGGGTCTTTGGCAGTAAGCTTTTTGTTTTCTATTTGTATGGTTTTTTTAAATTATGATA 
ATCTTTTTTCAAAAAAGGTTTTTTATTTTCATTCTAGCAAGGGATTTGTTGCTAATTTAAGATATTTAAGAGATGA 
ACAAAATTTGAAAGAT^TTTAGATCTTTTAGTAAAAGATTTTCTTTTAGGAAGCAATGAAGGGTTTTCTTTTGGG 
TTTTTATTAAGTGATTCAAGATTTTTATATTCTTTTTTAAAGAATGGAGTTTATTATGTAAATCTTTCAAGAGAAT 
TT^ATGATTCTTT^AATAATGGTGATTATAATGAATCTAATGAATCTTTTGATGTTAAGGTCAATCTTTTTGCTAT 
GTCTTTAATAAAAACAATGCGCTTTAACTATCCTGGTAAGATAAAAAAGATTGTTATTCTTGTTGAAGGGTGTATC 

TTAAAGGAGCAAAGTTGA - , • ^ - 

t216.nt * 

TGTATGGTTTTTTTAAATTATGATAATCTTTTTTCAAAAAAGGTTTTTTATTTTCATTCTAGCAAGGGATTTGTTG 
CTAATTTAAGATATTTAAGAGATGAACAAAATTTGAAAGATAATTTAGATCTTTTAGTAAAAGATTTTCTTTTAGG 
AAGCAATGAAGGGTTTTCTTTTGGGTTTTTATTAAGTGATTCAAGATTTTTATATTCTTTTTTAAAGAATGGAGTT 
TATTATGTAAATCTTTCAAGAGAATTTTATGATTCTTTTAATAATGGTGATTATAATGAATCTAATGAATCTTTTG 
' ATGTTAAGGTCAATCTTTTTGCTATGTCTTTAATAAAAACAATGCGCTTTAACTATCCTGGTAAGATAAAAAAGAT 
TGTTATTCTTGTTGAAGGGTGTATCTTAAAGGAGCAAAGTTGA 

f328.aa 

MAIKYARENNIPFLGICLGLQL^ 

VILKKNTIAFKLYGQDRIIERFRHRYEVNNDYIDLFAKNGLIVSGFSSDFKMAKLIEIPENKFFVACQFHPELITR 
IENPAKLFLGLIKACI* 
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t328.aa 

CLGLQLAVIEFARIWCGILDADTEENLARDKPLKSPVIHLLPEQKGIKDKGATMRLGGYPVILKKNTIAFKLYGQD 
RIIERFRHRYEVNNDYIDLFAKNGLIVSGFSSDFKMAKLIEIPENKFFVACQFHPELITRIENPAKLF 

LGLIKACI 
f328.nt 

ATGGCTATTAAATATGCTCGTGAGAATAATATTCCCTTTCTTGGAATTTGTCTTGGTTTGCAGCTTGCTGTAATAG 
AATTTGCTCGTAATGTTTGTGGAATACTTGATGCTGATACGGAGGAAAATTTAGCAAGAGACAAGCCCTTAAAAAG 
TCCTGTTATCCATTTACTTCCTGAGCAAAAGGGAATTAAAGATAAGGGCGCTACAATGAGGCTTGGTGGATATCCT 
GTGATTCTTAAAAAGAATACAATAGCTTTTAAACTTTATGGCCAAGATCGGATAATTGAAAGATTTAGACATAGGT 
ATGAAGTCAATAATGATTATATAGATTTATTTGCAAAAAATGGGCTTATAGTATCTGGATTTTCAAGTGATTTTAA 
AATGGCAAAATTAATAGAAATTCCTGAAAATAAATTTTTCGTAGCTTGCCAGTTTCATCCAGAACTTATTACAAGA 
ATAGAAAATCCAGCCAAGCTTTTTCTAGGATTAATTAAAGCTTGTATTTGA 

t328.nt ' . 

TGTCTTGGTTTGCAGCTTGCTGTAATAGAATTTGCTCGTAATGTTTGTGGAATACTTGATGCTGATACGGAGGAAA 

ATTTAGCAAGAGACAAGCCCTTAAAAAGTCCTGTTATCCATTTACTTCCTGAGCAAAAGGGAATTAAAGATAAGGG 

CGCTACAATGAGGCTTGGTGGATATCCTGTGATTCTTAAAAAGAATACAATAGCTTTTAAACTTTATGGCCAAGAT 

CGGATAATTGAAAGATTTAGACATAGGTATGAAGTCAATAATGATTATATAGATTTATTTGCAAAAAATGGGCTTA 

TAGTATCTGGATTTTCAAGTGATTTTAAAATGGCAAAATTAATAGAAATTCCTGAAAATAAATTTTTCGTAGCTTG ■ 

CCAGTTTCATCCAGAACTTATTACAAGAATAGAAAATCCAGCCAAGCTTTTTCTAGGATTAATTAAAGCTTGTATT 

TGA . ' 

■ f352 .aa 

MNKTKNRSLTYFIILSCISLFGANNNTISYSSIEIPLEDLSEEFKSSGNKSDQINTSKHLNKNIVSYEDPKKGKDL 
KLPENIRDKKLPQKRMDENDLKSVIENYENKIKNIEKLLKTKNQKTSENENKKIESIEKKAKKYEILTNKL^EIV 
EIKKLLNKKIKPKEDENYEKINIENIEEETDDDFEDNYEYNDEIEXTNEDNYPSNEGIINNLKENLNENEKYYAIN, 
EKKIDELEDRINENENTILDLQRELRNFKKKDNSDKNLEEIEENLSSIGRIINDLKRKISANEAINKENQKKIRTD 

KHKLKELEDKIKENEETILKLQKELNNFKKKEIYQKPLNEE^^ 

EENTKSTPKTTMIKTADFQIYPDIYLNNYKFKEKGDQFAFKKENTYYIEIDPTNNLNEALKNHEIISKYKFEKYFI 
NPILKNKEEFFRNLIEVKNIHELGIMYKNLKPEFKQIKIIK 

t352 .aa • ' 

CISLFGANNNTISYSSrEIPLEDLSEEFKSSGNKSDQINTSKHL 

DENDLKSVI ENYENKIKNI EKLLKTKNQKTSENENKKIES IEKKAKKYE I LTNKLKNE I VE I KKLLN 
NYEKINIENIEEETDDDFEDNYEYNDEIEXTNEDNYPSNEGIINNLKENLNENEKYYAINEKKIDELEDRINENEN 

TILDLQRELRNFKKKDNSDKNLEEIEENLSSIGRIINDLKRKIS^^ 
TILKLQKELNNFKKKEIYQKPLNEETFTPSITSKNDDLEENKKL 

DFQIYPDIYLNNYKFKEKGDQFAFKKENTYYIEIDPTNNLNEALKNHEIISKYKFEKYFINPILKNKEcFFRNLIE 
VKNIHELGIMYKNLKPEFKQIKIIK - 

f352.nt 

ATGAATAAAACAAAAAATCC-AAGCCTTACGTATTTTATAATACTTTCATGTATATCATTATTTGGGGCTAATAATA 
ATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTGAAGAATTTAAAAGTTCTGGGAATAAAAG 
CGATCAAATAAATACCTCAAAACATTTAAACAAAAACATAGTTTCTTATGAAGACCCAAAAAAGGGTAAAGATCTA 
AAATTGCCAGAAAATATAAGAGACAAAAAACTACCCCAAAJ^GAATGGACGAAAATGATCTAAAATCTGTAATTG 
AAAATTATGAAAATAAAATTAAAAACATAGAAAAGCTTTTAAAAACCAAAAATCAAAAAACATCGGAAAATGAAAA 

^AAJ^AAAATAGAJ^TCAATCGAAAAAAAAGCAAAAAAATATGAAATT 

GA \ATAAAAAAGCTCCTTAACAAAAAAJ^TCAAGCCTAAAGAAGA.TGAAAATTACGAAAAAATAAATATTGAAAACA 
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TTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATGATGAAATTGAAGAACAAATGAGGACAAT 
TACCCTTCTAATGAAGGAATAATAAACAATCTAAAAGAAAATCTTAATGAAAACGAAAAATATTATGCTATTAATG 
AAAAAAAAATCGATGAACTTGAAGACAGAATCAACGAGAATGAAAACACTATTTTAGACTTGCAAAGAGAATTAAG 
GAATTTT AAAAAAAAAGAT AAC TCAGAT AAAAAC TTAGAAGAAATTGAGGAAAATTT^ 
ATTAATGATC.TAAAAAGAAAAATCAGCGCAAATGAAGCAATA 

AAC AC AAAC TC AAAG AATTAGAAG ATAAAATAAAGGAAAATGAAG AGAC T ATTTT AAAAC TTC AAAAAGAATT AAA 
CAATTTTAAAAAAAAAGAAATTTATCAAAAACCCTTAAATGAAGAAACTTTCACTCCAAGCATTACAAGTAAAAAT 
GACGAC TT AGAAGAAAAT AAGAAATTAAAAAAGGAATATTT AAAGC C C AT AGAAAAAAAAG AAAGC CGAGATC T AG 
AAGAAAATACTAAAAGCACCCCAAAAACAACTATGATAAAAACAGCAGATTTTCAAATCTACCCTGACATATATCT 
TAATAATTATAAATTTAAAGAAAAGGGAGATCAATTTGCATTTAAAAAAGAAAACACATACTATATTGAAATAGAT 
CCCACTAACAATTTAAATGAGGCTTTAAAAAATCATGAAATAATCTCAAAATATAAATTTGAAAAATATTTCATTA 
ACCCTATTCTAAAAAATAAAGAAGAATTTTTTAGAAACTTAATAGAAGTCAAAAATATCCACGAACTAGGAATTAT 
GTATAAAAATCTAAAGCCTGAATTTAAGCAAATAAAAATAATTAAATAA 

t352.nt 

TGTATATCATTATTTGGGGCTAATAATAATACAATAAGCTACTCTAGCATTGAAATTCCTCTAGAAGACTTAAGTG 
AAGAATTTAAAAGTTCTGGGAATAAAAGCGATCAAATAAAT AC CTC AAAAC ATTTAAACA^ 

TGAAGACC CAAAAAAGGGT AAAG ATC TAAAATTGC CAGAAAAT AT AAG AGAC AAAAAAC T AC C C C AAAAAAGAATG ^ 

GACGAAAATGATCTAAAATCTGTAATTGAAAATTATGAAAATAAAATTAAAAACATA-GAAAAGC 

AAAATCAAAAAACATCGGAAAATGAAAATAAAAAAATAGAATCAATCGAAAAA^ 

AACCAATAAATTAAAAAACGAAATAGTAGAAATAAAAAAGCTCCTTAACAAAAAAATCAAGCCTAAAGAAG 
AATTACGAAAAAATAAATATTGAAAACATTGAAGAAGAAACTGATGATGATTTTGAAGACAATTATGAATATAATG 
ATGAAATTGAAGAACAAATGAGGACAATTACCCTTCTAATGAAGGAATAATAAACAATCTAAAAGAAAATCTT 
GAAAACGAAAAATATTATGCTATTAATGAAAAAAAAATCGATGAACTT.GAAGACAGAATCAACGAGAATGAAAACA 
C T ATTTT AGAC TTGC AAAGAGAATT AAGGAATTTT AAAAAAAAAGAT AAC TC AGAT AAAAAC TTAGAAGAAATTGA 
GGAAAATTTATCTTCAATAGGAAGAATAATTAATGATCTAAAAAGAAAAATCAGCGCAAATGAAGCAA^ 
GAAAATC AAAAAAAAAT AAGAAC TGAT AAAC AC AAAC TC AAAG AATTAGAAG ATAAAATAAAGGAAAATGAAG AG A 
CTATTTTAAAACTTCAAAAAGAATT AAAC AATTTTAAAAAAAAAGAAATTT ATC AAAAAC CCTTAAATGAAGAAAC 
TTTC AC TCCAAGC ATT AC AAGT AAAAATG AC G ACTT AGAAGAAAAT AAG AAATTAAAAAAGGAATATTTAAAGCCC 

AT AG AAAAAAAAGAAAGCCGAGATC T AGAAGAAAAT AC TAAAAGC AC CC C AAAAAC AAC T ATG AT AAAAAC AGC AG 
ATTTTCAAATCTACCCTGACATATATCTTAATAATTATAAATTTAAAGAAAAGGGAGATCAATTTGCATTTAAAAA 

AGAAAAC AC AT AC T AT ATTG AAAT AGATC C C AC T AAC AATTT AAATGAGGC TTTAAAAAATC ATGAAAT AATC TC A 

AAATATAAATTTGAAAAATATTTCATTAACCCTATTCTAAAAAATAAAGAAGAATTT-TTTAGAAACTTAA^ 

TCAAAAATATCCACGAACTAGGAATTATGTATAAAAATCTAAAGCCTGAATTTAAGCAAATAAAAATAATTAAATA 



£867 .aa 

MNTKGKVVGVNGNLVTIEVEGSVSMNEVLFVK^ 

ELGPGLLTQWDGLQNPLPELAIQCGFFLERGVYLRPLNKDKKWNFKKTSKVGDIVIAGDFLGFVIEGTVHHQIMI 
PFYKRDSYKI^/EIVSDGDYSIDEQIAVIEDDSGMRHNITMSFHWPVKVPITNYKSRLIPSEPMLTQTRIIDTFFPV 
AKGGTFCIPGPFGAGKTVLQQVTSRNADVDVVIIAACGERAGEVyETLKEFPELMDPKTGKSLMDRTCIICNTSSM 
PVAAREASVYTAITIGEYYRQMGLDILLLADSTSRWAQAMREMSGRLEEIPGEEAFPAYLESVIASFYERAGIVVL 
NNGDIGSVTVGGSVSPAGGNFEEPVTQATLKWGAFHGLTRERSDARKFPAISPLESWSKYKGVIDQKKTEYARSF 
LVKGNEINQMMKVVGEEGISNDDFLIYLKSELLDSCYLQQNSFDSIDAAVSSERQNYMFDIVYNILKTNFEFSDKL 

QARDFINELRQNLLDMNLSSFKDHKFNKLEHALGELINFKKVI 

t867.aa ' 

GRNLKAEVIRIRGNEVDAQVFELTKGISVGDLVEFTDKLLTVELGPGLLTQVTDGLQNPLPELAIQCGFFLERGVY 
LRPLNKDKKWNFKKTSKVGDIVIAGDFLGFVIEGTVHHQIiMIPFYKRDSYKIVEIVSIXSDYSIDtQIAVIEDDSGM 

RHNITMSFHWPVKVPITNYKERLIPSEPMLTQTO^ 

AACGERAGEWETLKEFPELMDPKTGKSLMDRTCIICNTSSMPVAAREASVYTAITIGEYYRQMGLDILLLADSTS 

RWAQ AMREMSGRL E E I PG EEAF PAY L E S V I AS F Y ERAG I WLNNGD I G S VTVGG S V S P AGGNF E E PVTQ AT LKWG 
AFHGLTRERSDARKFPAISPLESWSKYXGVIDQKKTEYARSFLVKGNEINQMMKWGEEGISNDDFLIYLKSELLD 
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SCYLQQNSFDSIDAAVSSERQNYMFDIVYNILKTNFEFSDKLQARDFINELRQNLLDMNLSSFKDHKFNKLEHALG 
ELINFKKVI 

f867.nt 

ATGAATACAAAAGGAAAAGTCGTTGGAGTTAATGGAAACTTAGTTACTATTGAGGTAGAAGGTTCAGTTTCTATGA 
ATGAAGTTTTATTTGTAAAGACTGCTGGTAGGAATTTAAAAGCAGAAGTAATTCGTATTAGGGGCAATGAAGTTGA 
TGCACAGGTTTTTGAATTGACAAAAGGGATATCTGTTGGAGACCTAGTTGAATTTACAGACAAACTTTTAACAGTT 
GAACTCGGACCAGGGCTTTTAACTCAAGTATATGATGGGCTTCAAAATCCTTTGCCTGAATTGGCTATTCAATGTG 
GATTTTTTTTAGAAAGGGGAGTATATTTAAGGCCCTTGAATAAAGATAAAAAGTGGAATTTTAAAAAAACCTCCAA 
AGTTGGAGATATCGTTATTGCAGGAGATTTTTTAGGTTTTGTAATTGAGGGAACTGTTCACCATCAAATAATGATT 
'CCATTTTATAAAAGGGATTCTTATAAAATTGTGGAGATTGTAAGTGATGGCGACTATTCGATTGATGAGCAAATTG 
CTGTAATTGAAGATGATTCTGGTATGAGGCATAATATTACAATGTCTTTTCATTGGCCTGTTAAAGTTCCTATTAC 

taSSSag^^ 

gc^aISggtggaactttttgcattccgggtccttttggagcaggaaaaacggttcttcagcaggttacaagtcgaa 
atcc^a^gItgtagtgattattgcagcttgtggtgagcgagcaggagaagtggt^^ 

icCCGAATTAATGGATCCAAAAACCGGC^TCTTTAATGGACAGGACTTG 

ccagttgcagctagagaagcttctgtttatactgctattactattggtgagtattacaggcaaatgggccttgata 

TTCTTCTTTTGGCAGATTCAACTTCAAGATGGGCTCAAGCAATGAGAGAAATGTCTGGACGCCTTGAGGAAATTCC 

Sgcga^aggcttttccggcatatcttgagtctgttattgcttccttttatgaaagggcaggtattgtagttctt 
StaatcgSSattggatctgtaacagttggtggctctgt^^ 

c^caagcaactttaaaagttgtaggagcatttcacgggcttacaagagaaaggtctgatgctaggaaatttccagc 
tat^ctcctcttgaatcttggag 

Sggtgaaaggtaatgaaattaatcaaatgatgaaagttgttggagaagaaggcataag^ 

ScagIgc^SLttatatgtttgatatagtttataacattcttaaa^^ 

ttaataaattggagcatgctttgggtgaattgataaattttaaaaaggtaatttag 

t867.nt 

ggtaggaatttaaaagcagaagtaattcgtattaggggcaatgaagttgatgcacaggtttttgaattgacaaaag 
SSgttggagacctagttgaatttacagacaaacttttaacagttgaactcggaccagggcttttaactcaact 

gSSSS^gata^ 

StSagattgtaagtgatggcgactattcgattgatgagcaaattgctgtaat^ 
ata^tattacaatgtcttttcattggcctgttaaagttcctattactaattataaggaacgccttattcc 

acctaJg^gactcaaactagaattatagatacatttttcccagttgccaaagg^^ 

ccttttgga^caggaaaaacggttcttcagcaggttacaagtcgaaatgctgatgttgatgta 

cttgtggtgagcgagcaggagaagtggtagaaactcttaaagaatttcccgaattaatggat^ 

ATCTTTAATGGACAGGACTTGTATTATTTGTAATACA-TCTTCAATGCCAGTTGCAGCTAGAGAAGCTT 
ACTGCTATTACTATTGGTGAGTATTACAGGCAAATGGGCCTTGATATTCTTCTTTTGGCAGATTCAACTTCAAGAT 

gtctgttattgcttccttttatgaaagggcaggtattgtagttcttaataatggggatattggatctgtaacagtt 

ggtggctctgtaagtcctgctggtggtaattttg^ 
ttcacgggcttacaagagaaac^gtctgatgctaggaaatt^^ 

taaaggcg^tattgatcaaaaaaagactgaatatgcaagatcttttttggtgaaaggtaatgaaattaatcaaat? 

atg^aag^tgttggagaagaaggcataagtaacgatgattttttaatttatttaaaatccgagctact 

gc?a^tgc?gcaaaattcatttgattctattgatgctgctgttagttcagagcgtcaaaattata 

agtttataacattc^taaaactaactttgagtttt 

CAAA^TCTTTTAGACATGAATCTTTCTTCTTTTAA-GGATCATAAGTTTAATAAATTGGAGCATGCTTTGGGTGAAT 

TJGATAAATTTTAAAAAGGTAATTTAG 
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f868.aa 

MKRVYSKIESIAGNVITVTAQGIKYGELAIVKAKDTSSLAEVIKLDREKVSLQVYGGTRGVSTSDEIKFLGHSMQV 
SFSDNLLGRIFDGSGNPRDGGPSLDDNLIEIGGPSANPTKRIVPRNMIRTGLPMIDVFNTLVESQKLPIFSVSGEP 
YNELLIRIALQAEVDLIILGGMGLKHDDYLTFKDSLEKGGALSRAIFFVHTANDSWESLTVPDISLSVAEKFALK 
GKKVLVLLTDMTNFADAMKEISITMEQVPSNRGYPGDLYSQLAYRYEKAIDFEGAGSITILAVTTMPGDDVTHPVP 
DNTGYITEGQYYLKGGRIEPFGSLSRBKQMVNSRTRDDHRTIMDSMIKLYASSKESVEKKAMGFNMTKWDEKLLKY 
SNMFESKMMDLSVNIPLEEALDLGWSILASCFSPKETGIKTDLIEKYWPKKETY 

t868.aa 

QGIKYGELAIVKAKDTSSLAEVIKLDREKVSLQVYGGTRGVSTSDEIKFLGHSMQVSFSDNLLGRIFDGSGNPRDG 
GPSLDDNLIEIGGPSANPTKRIVPRNMIRTGLPMIDVFNTLVESQKLPIFSVSGEPYNELLIRIALQAEVDLIILG 
GMGLKHDDYLTFKDSLEKGGALSRAIFFVHTANDSWESLTVPDISLSVAEKFALKGKKVLVLLTDMTNFADAMKE 
ISITMEQVPSNRGYPGDLYSQLAYRYEKAIDFEGAGSITILAVTTMPGDDVTHPVPDNTGYITEGQYYLKGGRIEP 
FGSLSRLKQMWSRTRDDHRTIMDSMIKLYASSKESVEKKAMGFNMTKWDEKLLKYSNMFESKMMDLSVNIPLEEA 

' LDLGWS I LASCF S PKETG I KTDL IEKYW PKKET Y 

f868.nt - 

ATGAAAAGAGTCTATAGTAAAATAGAGTCTATAGCAGGCAATGTAATAACTGTTACAGCTCAAGGTATTAAGTATG 
GTGAGCTTGCTATTGTAAAAGCAAAAGATACAAGTTCTCTAGCCGAAGTAATTAAACTTGATGGAGAAAAAGTTTC 
■ TCTTCAGGTTTATGGTGGTACAAGAGGTGTTTCCACGTCAGACGAGATAAAGTTTTTAGGGCATTCAATGCAGGTT 
TCATTTTCTGACAATTTGTTGGGCAGAATTTTTGATGGTTCTGGGAATCCTAGAGATGGGGGCCCTTCTCTTGATG 
ATAATTTGATTGAAATTGGTGGGCCTTCTGCAAATCCTACAAAACGCATTGTTCCTAGAAATATGATAAGGACAGG 
GCTTCCAATGATAGATGTTTTTAATACTCTTGTTGAATCTCAAAAATTGCCAATTTTTTCTGTTTCTGGTGAGCCT 
TATAATGAGCTTCTTATAAGAATTGCACTTCAAGCAGAAGTTGATTTAATAATTCTTGGCGGAATGGGACTTAAGC 
ATGATGATTATTTAACTTTTAAAGATTCTTTAGAAAAGGGAGGTGCTTTAAGTAGAGCAATTTTTTTTGTTCATAC 
TGCTAATGATTCTGTTGTTGAATCTTTAACTGTTCCTGATATTTCACTTTCTGTTGCTGAAAAGTTTGCTCTAAAG 
GGCAAAAAAGTTTTGGTGCTTCTCACAGACATGACAAATTTTGCTGATGCAATGAAAGAAATATCTATTACAATGG 
AACAAGTGCCTTCTAATAGAGGTTATCCCGGGGATTTGTATTCTCAGCTTGCATATCGTTATGAGAAGGCTATTGA 
CTTTGAAGGCGCAGGATCAATTACAATACTTGCAGTTACAACAATGCCGGGTGACGATGTTACTCATCCTGTTCCT 
GACAATACTGGATACATTACAGAAGGTCAATACTATTTAAAAGGTGGCAGAATAGAGCCTTTTGGGTCTCTTTCAA 
GACTTAAGCAAATGGTAAATAGTAGAACTAGAGACGATCACAGGACTATAATGGATTCAATGATCAAGCTTTATGC 
ATCTTCAAAAGAGTCTGTAGAAAAAAAGGCTATGGGATTTAATATGACTAAGTGGGATGAAAAATTGCTCAAGTAT 
AGCAATATGTTTGAAAGTAAGATGATGGATTTGTCTGTTAATATTCCTTTAGAAGAGGCTTTAGATTTAGGTTGGA 
GCATTCTTGCTAGTTGTT.TTAGCCCAAAAGAAACGGGAATAAAAACAGATCTTATTGAAAAATATTGGCCTAAAAA 

AGAGACTTATTGA . 

t8 68 -nt . , . . 

CAAGGTATTAAGTATGGTGAGCTTGCTATTGTAAAAGCAAAAGATACAAGTTCTCTAGCCGAAGTAATTAAACTTG 
ATCGAGAAAAAGTTTCTCTTCAGGTTTATGGTGGTACAAGAGGTGTTTCCACGTCAGACGAGATAAAGTTTTTAGG 
GCATTCAATGCAGGTTTCATTTTCTGACAATTTGTTGGGCAGAATTTTTGATGGTTCTGGGAATCCTAGAGATGGG 
GGCCCTTCTCTTGATGATAATTTGATTGAAATTGGTGGGCCTTCTGCAAATCCTACAAAACGCATTGTTCCTAGAA 
" ATATGATAAC-GACAGGGCTTCCAATGATAGATGTTTTTAATACTCTTGTTGAATCTCAAAAATTGCCAATTTTTTC 
TGTTTCTGGTGAGCCTTATAATGAGCTTCTTATAAGAATTGCACTTCAAGCAGAAGTTGATTTAATAATTCTTGGC 
GGAATGGGACTTAAGCATGATGATTATTTAACTTTTAAAGATTCTTTAGAAAAGGGAGGTGCTTTAAGTAGAGCAA 
TTTTTTTTGTTCATACTGCTAATGATTCTGTTGTTGAATCTTTAACTGTTCCTGATATTTCACTTTCTGTTGCTGA 
AAAGTT^GCTCTAAAGGGCAAAAAAGTTTTGGTGCTTCTCACAGAeATGACAAATTTTGCTGATGCAATGAAAGAA 
ATATCTATTACAATGGAACAAGTGCCTTCTAATAGAGGTTATCCCGGGGATTTGTATTCTCAGCTTGCATATCGTT 
ATGAGAAGGCTATTGACTTTGAAGGCGCAGGATCAATTACAATACTTGCAGTTACAACAATGCCGGGTGACGATGT 
TACTCATrCTGTTCCTGACAATACTGGATACATTACAGAAGGTCAATACTATTTAAAAGGTGGCAGAATAGAGCCT 
TTTGGGTCTCTTTCAAGACTTAAGCAAATGGTAAATAGTAGAACTAGAGACGATCACAGGACTATAATGGATTCAA 
TGATCAAGCTTTATGCATCTTCAAAAGAGTCTGTAGAAAAAAAGGCTATGGGATTTAATATGACTAAGTGGGATGA 
AAAATTGCTCAAGTATAGCAATATGTTTGAAAGTAAGATGATGGATTTGTCTGTTAATATTCCTTTAGAAGAGGCT 
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TTAGATTTAGGTTGGAGCATTCTTGCTAGTTGTTTTAGCCCAAAAGAAACGGGAATAAAAACAGATCTTATTGAAA 
AATATTGGCCTAAAAAAGAGACTTATTGA 

f872,aa . 

MRSAVLFFFALPFSISLYSSSNKNFPYWILLEKGRQFLYSKSEFSKSNLTHAINYLQEALLRKGVYPEASYYLSVA 

ySsgnaiLklnlyksfedryyll^^ 

ENSMSNNELNASFYLTSYLKQVRGAFGIDFTFNLYRFKlSrYNVIDTHQLLSKVYLHLKAYELS.ITHGLIA 

Sdw^epwqfknlrsfvqkinkykaiknafestdfweivywaaatyaysngnykfraidtw^ 
spyiaksrsqxknsvylkkn 

t872 . aa • ■ 

snknfpywillekgrqflysksefsksnlthainylqeallrkgvypeasyylsvaygmsgnaileklnlyksfed 
ryylldesfekkilfsi^kmaelennyvdtidylndilnkfstkkdyysyhdysqgensmsnnelnasfyltsylk 
5^ga°gidftfnlyrfknynvidthqllskvylhlkayelsithgliaavgiltrmydwcy 
vqkinkykaiknafestdfweivynvaaatyaysngnykfraidtwklvvdlaprfspyiaksrsqikisisvylkkn 

f872.nt , 

ATGAGAAGTGCGGTTTTATTTTTTTTTGCTTTGCCTTTTTCTATTTCTTTGTATTCTTCAAGTAATAAAAATTTTC 
CGTATTCGATTTTACTTGAAAAAGGCAGGCAATTTCTTTATTCTAAATCTGAATTTAGTAAGTCTAATCT^ 
TCCTATTAATTATTTGCAGGAAGCTTTGCTTAGAAAAGGCGTTTATCCTGAGGCTAGTTATTATTTGTCAGTAGCT 
TATGGTATGTCTGGCAATGCTATTCTTGAAAAATTAAACCTTTATAAGTCTTTTGAAGACAGATATTATTTGCTAG 

Sgaatcttttgaaaaaaaaatacttt^ 

tca^atttgaatgacatattaaataagttttcaactaaaaaagattattatagttatc^^ 

gaaaacagtatgtcaaataatgaacttaatgcttcattttatttaacttcttatttaaaacaagtaagaggagctt 

ttg^attgattttacttttaatctttacagatttaaaaactacaatgttattgatactcatcaattattgtcaaa 

agtttatttgcacttaaaagcttatgagctttcaattactcatggacttatagctgcagta 
atgtatgattatgtttgttattatgaacctgtgtatcagtttaaaaatttaaggtcttttgttcaaaaaat^ 

AGTATAAGGCAATAAAAAATGCTTTTGAATCTACAGATTTTTGGGAAATAGTTTATAATG 

tgcaJattctaatggc^ 
' TCTCCTTATATTGCTAAATC aagaagtcaaattaaaaattctgtatatttaaaaaaaaattaa 

t872.nt - ~ " 

agtaataaaaattttccgtattggattttacttgaaaaaggcaggcaatttctttattctaaatctgaatttagta 
agtctaatcttagacatgctattaattatttgcaggaagctttgcttagaaaaggcgtttatcctgaggctagtta 

ttatttgtcagtagcttatcsgtatgtctggcaatgctatt^ 

agatattatttgctagatgaatcttttgaaa^ 

a^atgt^ 

tgattattctca^ggcgaaaacagtatgtcaaataatgaacttaatgcttcattttat^ 

caagtaagaggagctttt^tattgat™^ 
atcaattattgtcaaaagtttatttgcactt^ 

aggaattttaacaagaatgtatgattatgtttgttattatgaacctgtgtatcagtttaaaaatttaaggtctttt 

ttgctgctgctacttatgcatattctaatggcaattataaatttagagcaatagatacttggaaattagtagtaga 
tcttgcgccaaggttttctccttatattgctaaatcaagaagtcaaattaaaaattctgtatatttaaaaaaaaat 

TAA ' ■ 

f874.aa - 

^mlksnkwligaggvgssfayaltidnslvhelviidvnenkakgevmdl^gqmflkkninvlfgtykdcanadi 
wSLlnqkpgetrldlvd^ 

SRLRYFLSDHFNVNTQNIHSYIMGEHXDSSFATWDETKIAMKPLSEYLAEGKITELELDEiHKKVVNAAYtiVIKLK 
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GATYYAIGLGIKNIVNAIIGDQNVILPISSYINGQYGGLIKDIYIGAPAIVCKEGVKEVLNFKISPKELDKFNSSA 
NQLKSYIDKMEF 



t874.aa _ 

ALTIDNSLVHELVIIDVNENKAKGEVMDLNHGQMFLKKNINVLFGTYKDCANADIWITAGLNQKPGETRLDLVDK 
NSKIFKDIITNWSSGFDGIFWASNPVDIMTYVTMKYSKFPIHKVIGTGTILDTSRLRYFLSDHFNVNTQNIHSY 
IMGEHXDSSFATWDETKIAMKPLSEYLAEGKITELELDEIHKKVVNAAYEVIKLKGATYYAIGLGIKNIVNAIIGD 
" QNVILPISSYINGQYGGLIKDIYIGAPAIVCKEGVKEVLNFKISPKELDKFNSSANQLKSYIDKMEF 

> 

£374.. nt 

ATGCTTAAGTCTAATAAAGTTGTTCTTATTGGAGCTGGTGGGGTTGGTTCAAGCTTTGCGTATGCTTTAACAATAG 
ACAATTCACTTGTACATGAACTTGTAATTATTGATGTTAATGAAAATAAAGCAAAAGGGGAGGTCATGGACCTTAA 
TCATGGCCAAATGTTTTTAAAGAAGAATATTAATGTATTGTTTGGGACTTACAAAGATTGTGCTAATGCAGATATT 
GTTGTAATTACAGCAGGACTTAATCAAAAGCCTGGTGAGACAAGACTTGATTTGGTTGATAAAAATTCTAAAATTT 
TTAAAGATATTATAACTAATGTTGTATCTAGCGGTTTTGATGGTATTTTTGTTGTTGCAAGCAATCCTGTAGACAT 
TATGACTTATGTTACAATGAAATATTCCAAATTTCCTATTCATAAGGTTATTGGTACTGGGACTATTCTTGATACT 
TCAAGACTTAGAT-ATTTTTTAAGTGATCATTTTAATGTGAACACTCAAAATATACATTCATATATTATGGGTGAGC 
ACGTGACAGTTCTTTTGCTACGTGGGATGAAACAAAAATAGCAATGAAGCCTTTGTCAGAATATCTTGCTGAAGGC 
AAAATAACTGAGTTGGAGCTTGATGAAATTCATAAAAAGGTTGTGAATGCTGCTTATGAAGTTATTAAGTTAAAGG 
GGGCAACCTATTATGCTATTGGACTTGGTATTAAGAATATTGTAAATGCAATAATTGGAGATCAGAATGTTATTCT 
GCCAATATGTTCTTATATTAATGGCCAGTATGGGGGATTGATTAAAGATATTTATATTGGAGCGCCTGCTATAGTT 
TGTAAGGAAGGAGTCAAAGAAGTTTTAAACTTTAAGATAAGCCCTAAAGAGCTTGATAAGTTTAATAGTTCTGGTA 

ATC AGC TT AAAAGC TAT ATTG AT AAAATGG AATTTT AG 
t874.nt 

GCTTTAACAATAGACAATTCACTTGTACATGAACTTGTAATTATTGATGTTAATGAAAATAAAGCAAAAGGGGAGG 
TCATGGACCTTAATCATGGCCAAATGTTTTTAAAGAAGAATATTAATGTATTGTTTGGGACTTACAAAGATTGTGC 
TAATGCAGATATTGTTGTAATTACAGCAGGACTTAATCAAAAGCCTGGTGAGACAAGACTTGATTTGGTTGATAAA 
AATTCTAAAATTTTTAAAGATATTATAACTAATGTTGTATCTAGCGGTTTTGATGGTATTTTTGTTGTTGCAAGCA 
ATCCTGTAGACATTATGACTTATGTTACAATGAAATATTCCAAATTTCCTATTCATAAGGTTATTGGTACTGGGAC 
TATTCTTGATACTTCAAGACTTAGATATTTTTTAAGTGATCATTTTAATGTGAAGACTCAAAATATACATTCATAT 
ATTATGGGTGAGCACGTGACAGTTCTTTTGCTACGTGGGATGAAACAAAAATAGCAATGAAGCCTTTGTCAGAATA 
TCTTGCTGAAGGCAAAATAACTGAGTTGGAGCTTGATGAAATTCATAAAAAGGTTGTGAATGCTGCTTATGAAGTT 
ATTAAGTTAAAGGGGGCAACCTATTATGCTATTGGACTTGGTATTAAGAATATTGTAAATGCAATAATTGGAGATC 
^GAATGTTATTCTGCCAATATCTTCTTATATTAATGGCCAGTATGGGGGATTGATTAAAGATATTTATATTGGAGC 
GCCTGCTATAGTTTGTAAGGAAGGAGTCAAAGAAGTTTTAAACTTTAAGATAAGCCGTAAAGAGCTTGATAAGTTT 

AATAGTTCTGCTAATC AGC TTAAAAGCTATATTGAT AAAATGG AATTTTAG : 
f88 6.aa 




KITCYDTKDTKRKEEIYDNLNNKIQbJ.tYLiijis.u^iL.c.iAiNi vi^^^-u^.^— — . ^^-^-pr 

TGSNFQIWNYGINGDIKSTYFDIKKATTKVIKYDDKKRNSNSTIIVNNKIKSKEKNQYLDEEKIVNTFEEENTKII 

STYKANNLIKEETYKNNELIKVNDFQYNESDMIIFQNTKEKDKDQYTNTKIEYEYNKDNQLKSKKIYENDIIYLKT 
EYHNDNEYEEEIYYNKKPALRVKHKNGKVTEEKPIGTN , . 

t886.aa - 

SYFASDVFFNKYQKLNEKPKTGFYIEYYSV^DTSKLYLYKENNLIKYKTIQIIENTKKITCYDTKDT^EEIYDN 
LNNTCIQEIEYDSKGKTLETANY^ENF^LISKNLKTINQKPKLIYYSKDDNGKLLKITGSNFQIWNYGINGmKST 
YFDIKKATTKVIKYDDKKRNSNSTIIVNNKIKSKEKMQYLDEEKIVNTFEEENTKIISTYKANNLIKEETY^^EL 
IKVNDFQYNESDMIIFQNTKEKDKDQYTNTKIEYEYNKDNQLKSKKIYENDIIYLKTEYKtIDNEYEEEJ.YYNKj\PA 

LRVKHKNGKVTEEKPIGTN 
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f886.nt 

ATGAAAAAAAAACAATTAATACTTCTTCTATTTATGCCACAAATTATTTATGCAAAAAGCTATTTTGCATCTGATG 
TATTTTTCA^TAAATACCAAAAATTAAATGAAAAACCAAAAACGGGGTTTTATATTGAGTATTATTCTGTTGATGA 
TACTGAAAAACTCTACCTATACAAAGAAAATAACTTAATAAAATACAAAACAATTCAAATCATAGAAAACACAAAA 
AAAATTACATGTTATGATACAAAAGATACAAAAAGAAAAGAAGAGATTTACGATAATTTAAATAACAAAATACAAG 
AAATTGAATATGATAGCAAAGGAAAAACTCTTGAAACAGCAAATTACGTTTATGAAAACGAAAACTTAATATGTAA 
AAATTTAAAAACAATAAACCAAAAACCAAAATTAATATATTATTCTAAAGACGACAATGGTAAATTACTAAAAATA 
ACAGGATCAAATTTCCAAATTTGGAACTATGGAATTAATGGCGACATAAAATCTACATATTTTGACATCAAAAAAG 
CAACAACAAAAGTTATAAAATATGATGATAAAAAAAGAAATTCAAACAGTACAATAATTGTTAATAATAAAATAAA 
ATCCAAAGAAAAAAACCAATATTTAGATGAAGAAAAAATAGTAAATACCTTTGAAGAAGAGAATACAAAAATCATA 
TCTACCTACAAGGCAAACAACCTAATTAAAGAAGAAACATATAAAAATAATGAACTTATAAAAGTAAATGATTTTC 
AATACAACGAATCTGATATGATAATTTTTCAAAACACTAAAGAAAAGGATAAAGACCAATACACCAATACTAAAAT 
TGAATACGAATATAACAAAGACAATCAATTAAAAAGCAAAAAAATTTATGAGAACGATATAATTTATCTAAAAACT 

GAATACCACAATGACAATGAATA' 

ACGGAAAAGTCACCGAAGAAAAACCAATAGGAACAAATTAA 
t886.nt 



.TGAAGAAGAAATATACTACAATAAAAAACCTGCTCTTAGGGTAAAACACAAGA 



AGCTATTTTGCATCTGATGTATTTTTCAATAAATACCAAAAATTAAATGAAAAACCAAAAACGGGGTTTTATATTG. 
AGTATTATTCTGTTGATGATACTGAAAAACTCTACCTATACAAAGAAAATAACTTAATAAAATACAAAACAATTCA 
AATCATAGAAAACACAAAAAAAATTACATGTTATGATACAAAAGATACAAAAAGAAAAGAAGAGATTTACGATAAT 
TTAAATAACAAAATACAAGAAATTGAATATGATAGCAAAGGAAAAACTCTTGAAACAGCAAATTACGTTTATGAAA 
ACGAAAACTTAATATCTAAAAATTTAAAAACAATAAACCAAAAACCAAAATTAATATATTATTCTAAAGACGACAA 
TGGTAAATTACTAAAAATAACAGGATCAAATTTCCAAATTTGGAACTATGGAATTAATGGCGACATAAAATCTACA 
TATTTTGACATCAAAAAAGCAACAACAAAAGTTATAAAATATGATGATAAAAAAAGAAATTCAAACAGTACAATAA 
TTGTTAATAATAAAATAAAATCCAAAGAAAAAAACCAATATTTAGATGAAGAAAAAATAGTAAATACCTTTGAAGA 
- AGAGAATACAAAAATCATATCTACCTACAAGGCAAACAACCTAATTAAAGAAGAAACATATAAAAATAATGAACTT 
ATAAAAGTAAATGATTTTCAATACAACGAATCTGATATGATAATTTTTCAAAACACTAAAGAAAAGGATAAAGACC 
AATACACCAATACTAAAATTGAATACGAATATAACAAAGACAATCAATTAAAAAGCAAAAAAATTTATGAGAACGA 
TATAATTTATCTAAAAACTGAATACCACAATGACAATGAATATGAAGAAGAAATATACTACAATAAAAAACCTGCT 
CTTAGGGTAAAACACAAGAACGGAAAAGTCACCGAAGAAAAACCAATAGGAACAAATTAA ; 

f888.aa ' 

MEKLKLKLAIPLLVFTICKIHSQSNIEYNFSYIINTKKENIDLKKGIEKQLDKIYDKITEHIVNNDDKSIIEDIY.I 
NQDIIKTELEISKLKKEMDKKKLQNIITAKEKHNTKTKIDELKKNIQNINNKQKKFAEYFNNLKKLKVKYKKIEEQ 
TNISNI^KEFFIRkELFFINYIDLKKIENYYLLEISNITPEKIETKKAVFKTSSSVNEIADHITCTSLrai^F 
LKININVKNNSDAKIYINEKFVSKGIYHDNIFDISKLPNKEIEIQITSANFENYSIKRTVKNADSIILDIDLKRTI 
SKKVSIKSNVQSKVFKKGIFMGETPIEIEKPENQDIILLKSKGYKDKFKLINKEEDQVEIEMIKTNKNRLIDTRDK 
FYVNLAVFTLSTIGAIFAGTLLNNSEVLYKI'TGNHFINKRLTAEDVYMAKAEQMTATFLFGVGITLTIGSFISLIT 

HL VEY I KEANMGE 
t8 8 8.aa 

• SNIEYNFSYIINTKiCENlbLKKGIEKQLDKIYDKITEHIVNNDDKSIIEDIYINQDIIKTELEISKLKKEMDKI^ti 
QNIITAKEKHNTKTKIDELKKNIQNIN^WQKKFAEYF^INLKKLKVKYKKIEEQTNISNLNKEFFIREELFFIi^lD 
LKKIENYVLLEISNITPEKIETKKAVFKTSSSVNEIADHITKYSLKEILGREFLKININ^/KMNSDAKIflNEKFVS 

' KGIYHDNIFDISKLPNKEIEIQITSANFENYSIKRTVKNADSIILDIDLKRTISKKVS'IKSNVQSKVFKKGIFMGE 
TPIEIEK^ENQDIILLKSKGYKDKFKLINKEEDQVEIEMIKTNKNRLIDTRDKFYVNLAVFTLSTIGAIFAGTLLN 
NSEVLYKITGNHFINKRLTAEDVYMAKAEQMTATFLFGVGITLTIGSFISLITHLVEYIKEANMGE 

f888.nt 
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ATrGAAAAGCTTAAACTAAAGCTAGCAATACCATTGCTAGTATTTACAATATGCAAAATACATTCTCAAAGTAATA 

JJgaa^aSt^ 
:aag 



. GGAC 

^a^^^^TPTArAATATTTTAACAATTTAAAAAAACTAAAAGTAAAATATAAAAAAATCGAAGAGCAA 

^gaaaIS^tItttgctagaaattagcaacatcactcctgagaaaattgagactaa^ 



^G^ATCATCTTCTGTTAATGAAATTGCAGATCACATAACAAAATACAGCCTCAAAGAAATATTGGGCAGAGAATTT 
^ CA ^™T™SI^rTCAAA^^ 

^^cga^aa?^ 

TTTTATGTCAATCTGGCCGTCTTTACATTAAGCACAATAGGAGCCATTTTTGCAGGAACATTGCTTAACAATTCAG 

' IIgScSta^ Iaaataacaggcaatcac™^ 

CATTTAGTAGAATATATTAAAGAAGCAAATATGGGAGAATAG ' 
t888 .nt 

agtaatattgaatacaatttttcctatatcattaatacaaaaaaagaaaatattgacctaaaaaa 

JGACAAAATCTATGATAAAATAACAGJ 

c^cat^cS 

TTAACAATAAA^AAAAAAAATTTGCA^ 

.gaISt^Itttgctagaaattagcaacatcactcctgagaaaattgagactaaa^ 



rTGGACAAAATCTATGATAAAATAACAGAACATATAGTAAACAATGATGACAAGAGCATCATTGAAGACAT 

ttatataaatcaagatataataaaaacagaacttgaaattagc 



AACAATa 



3CAAATTAAAAAAAGAAATGGATAAAAAAAAACTT 



AGAGCAAAC; 



AACAATCTC 
ACCGCAATTG 



AAAGTTTTT AAAAAAGGAAT ATTT ATGGGAGAA 



^TAAACAAAAAAAATTTGCAGAATATTTTAACAATTTAAAAAAACTAAAAGTAAAATATAAAAAAATCGA 

\ACAAATATATCAAATTTAAATAAAGAATTTTTTATAf 
CTTAAAAAAATAGAAAATTATTATTTGCTAGAAATTAGCAACAl^^^----^. _ 

VATCTATCACGATAATATTTTTGACATTTCTAAACTCCCA 
CTCGAAAACTATTCTATTAAAAGAACGGTAAAAAATGCAG, 
T C T C T AAAAAAGT ATC AATT AAAAGC AATGT AC AAT C T AA 
\ATTGAAATTGAAAAACCAGAAAATCAAGATATCATCTTG 
\TAAATAAAGAAGAAGATCAAGTAGAAATAGAAATGATAA 

CAAAAGCGGAACAAATGACTGCAACATTTCTATTTGGAGTAGGAATCACTTTAACTATTGGAAGCTTTATCTCATT 
AATAACTCATTTAGTAGAATATATTAAAGAAGCAAATATGGGAGAATAG 

f893.aa 



^aSc^cga^a^ 

CAAAT^CGAAAACTA^TCTATTAAAAGAACGGTAAAAAATGCAGAC 

TCTAAi 
TTGAAi 
AAATAi 

AGATAAATTTTATGTCAATCTGGCCGTCTTTACATTAAGCACAATAG* 



AGTT 



rGAGCCATTTTTGCAGGAACATTGCTTAAC 



MVRFLGFLYLITTIPLIKSCDAAQFGDYKPLYFENENDLKTANEYI^ 

jRKSIFLKKLSNLFNIEHKKLLYVENRFKSINFKNLKKELNINADI^ 
LLIFLDPTNSIFTLIFLLISSIxAFMISKEIMYFYPFTVLSYLLFLIISNFJtf^ 

TFTFTALYFITTTTFFTTNIDPTFIAFVAIPTLCIFLIFSWIKTESNFKDTFLFPIEIKEKKIEGKKALKSKIAxrt 



INKLNNLDLRKSIFLKKLSNLFNIEHKKLLYVENRFKSINFKNLKKELNINi^^^SLiDYKTKINFI 

LLIFLDPT] 
LFTFTALY1 

£kip^ke^f™ie^^ 

fslpffvllflfkairftillnin 

EKTYKKYIQG 

t393.aa - • 

CDAAQFGDYXFLYFENENDLKTANEYINSLGYKTISEYTTKIDILDFPENK^I 

NLFNIEHXKLLYVENRFKSINFKNLKKELNINADIHSLDYKTKINFISSxIrL.xIIIuLIrLDPTNSIFTLIFLLI 
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SSL^FMISKEIMYFYPFTVLSYLLFLIISNFNKNYNKIYLKEINFLTLMTKIKHLLFLFTFTALYFITITTFFTTN- 
IDPTFIAFVAIPTLCIFLIFSWIKTESNFKDTFLFPIEIKEKKIEGKKALKSKIAIHLLLFTLSLIPFAYSSYMLN 
SYENII^LYSKKLI^FDYLNPNNIYIMLGYNKDMPNIIGYLSHILYQNELKYNlTAiCYGKIPKDIKENYFEIKNDK 
XEIKPKTVYEVDKSFIDEILKKDLASLFLKNKNPILIYKENKNNINTDKKNYKILFFFSLPFFVLLFLFKAIRFTI 

LLNINEKTYKKYIQG - 



f893.nt 

ATGGTGCGTTTTTTAGGTTTTTTATATTTAATTACAACAATACCACTTATCAAATCCTGTGATGCAGCTCAATTTG 

GAGACTACAJ^CCTTTATACTTTGAAAATGAAAATGATCTAAAAACTGCCAATGAATATATAAATTCACTAGGATA 

CAAAACAATCTCAGAATACACAACAAAAATTGACATTTTAGACTTTCCCGAAAATAAAGAAATGACAATAAATGAG 

ATAAACAAACTTAACAATCTTGACCTGAGAAAAAGCATATTTTTAAAAAAGCTCTCCAATCTTTTCAACATAGAGC 

ACAAAAAACTTCTTTATGTTGAAAACAGGTTTAAAAGTATAAATTtTAAAAACCTAAAAAAAGAACTCAATATTAA 

TGCCGACATACATTCTCTTGACTACAAAACAAAAATTAATTTTATTTCAAGCATAATATTTCTAATCATAATAATT 

TTATTAATTTTTTTAGACCCAACAAACTCTATATTTACTTTAATTTTTCTATTAATTTCATCTCTTGCTTTTATGA 

TAAGCAAAGAAATAATGTATTTTTATCCATTTACAGTTCTCTCTTATTTGTTATTTTTAATAATCAGTAATTTTAA 

CAAAAATTACAATAAi^TATATTTAAAAGAAATAAATTTTTTAACACTAATGACAAAAATAAA^ 

TTATTTACATTCACAGCTCTATATTTCATTACAATCACAACCTTTTTTACTACAAATATTGATCCCACTTTTATTG 

CATTTGTCGCAATACCAACCCTTTGCATTTTCTTAATTTTCAGCTGGATAAAAACAGAAAGCAATTTTAAAGACAC 

TTTC TTATTC C c AATCG AG ATT AAAG AGAAAAAAAT AGAAGG AAAAAAAGC TTT AAAATC AAAAAT AGC AAT AC AT 

CTACTACTATTTACACTCTCATTAATTCCTTTCGCTTATTCAAGCTATATGCTAAATTCTTATGAAAACATTAACT 

ACCTTTACAGTAAAAAATTAAATTACTTTGATTATTTAAATCCTAATAACATTTATATAATGCTGGGATACAACAA 

AGACATGCCCAATATTATAGGGTACCTATCCCACATTCTTTATCAAAACGAACTAAAATACAATATTACCGCTAAG 

TATGGAAAAATTCCTAAAGATATAAAAGAAAATTACTTTGAAATCAAAiVACGACAAAATAGAAATTCATCCTAAAA 

CTGTTTACG AAGTAGACAAATCATTTATTGATGAAATTCTTAAAAAAGATCTTGCAAGTCTGTTTTTAAAAAATAA 

AAATCCAATCCTAATATATAAAGAAAACAAGAATAATATCAACACAGATAAAAAAAATTACAAAATACTTTTCTTT 

TTCTCTTTGCCCTTCTTTGTATTACTATTCCTATTTAAAGCAATAAGATTTACAATTCTTTTAAACATAAATGAAA 

AAAC G T AT AAAAAAT AT ATTC AAGG AT AA 



t893.nt 

TGTGATGCAGCTCAATTTGGAGACTACAAACCTTTATACTTTGAAAATGAAAATGATCTAAAAACTGCCAATGAAT 

ATATAAATTCACTAGGATACAAAACAATCTCAGAATACACAACAAAAATTGACATTTTAGACTTTCCCGAAAATAA 

AGAAATCACAATAAATGAGATAAACAAACTTAACAATCTTGACCTGAGAAAAAGCATATTTTTAAAAAAGCTCTCC 

AATCTTTTCAACATAGAGCACAAAAAACTTCTTTATGTTGAAAACAGGTTTAAAAGTATAAATTTTAAAAACCTAA 

AAAAAGAACTCAATATTAATGCCGACATACATTCTCTTGACTACAAAACAAAAATTAATTTTATTTCAAGCATAAT 

ATTTCTAATCATAATAATTTTATTAATTTTTTTAGACCCAACAAACTCTATATTTACTTTAATTTTTCTATTAATT 

TCATCTCTTGCTTTTATGATAAGCAAAGAAATAATGTATTTTTATCCATTTACAGTTCTCTCTTATTTGTTATTTT 

TAATAATCAGTAATTTTAACAAAAATTACAATAAAATATATTTAAAAGAAATAAATTTTTTAACACTAATGACAAA 

AATAAAACACTTACTATTTTTATTTACATTCACAGCTCTATATTTCATTACAATCACAACCTTTTTTACTACAAAT 

ATTGATCCCACTTTTATTGCATTTGTCGCAATACCAACCCTTTGCATTTTCTTAATTTTCAGCTGGATAAAAACAG 

AAAGCAATTTTAAAGACACTTTCTTATTCCCAATCGAGATTAAAGAGAAAAAAAT AGAAGG AAAAAAAGC TTTAAA^ 

ATCAAAAATAGCAATACATCTACTACTATTTACACTCTCATTAATTCCTTTCGCTTATTCAAGCTATATGCTAAAT 

TCTTATGAAAACATTAACTACCTTTACAGT^JVAAATTAAATTACTTTGATTATTTAAATCCTAATAACATTTATA 

TAATGCTGGGATACAACAAAGACATGCCCAATATTATAGGGTACCTATCCCACATTCTTTATCAAAACGAACTAAA 

ATACAATATTACCGCTAAGTATGGAAAAATTCCTAAAGATATAAAAGAAAATTACTTTGAAATCAAAAACGACAAJ\ 

ATAGAAATTCATCCTAAAACTGTTTACGAAGTAGACAAATCATTTATTGATGAAATTCTTAAAAAAGATCTTGCAA 

GTCTGTTTTTAAAAAATAAAAATCCAATCCTAATATATAAAGAAAACAAGAATAATATCAACACAGATAAAA 

TTACAAAATACTTTTCTTTTTCTCTTTGCCCTTCTTTGTATTACTATTCCTATTTAAAGCAATAAGATTTACAATT 

CTTTTAAACATAAATGAAAAiVACCTAT AAAAAAT AT ATTC AAGGATAA 
f895.aa 

MIRALLTNDLFLSCLVSGISAQVIKYGIQTVKTRKLKLTPVfiLLKKIFLETGGMPSSHSSTVTALSTSIALTEGID 
TNFIIAliAFALITIRDSFGVRYMSGVQAEYLNALSEKLKKEIKIDTTKIKVVKGHKKKEVLTGII J.GIVSAYIVCY 



^pr i8i 

TABLE 1. Nucleotide and Amino Acid Sequences 



tS95.aa 

AQVIKYGIQTVKTRKLKLTPVTiLLKKIFLETGGMPSSHS^STVTALSTSIALTEGIDTNFIIAIjAFALITIRDSFGV 
RYMSGVQAEYLNALSEKLKKEIKIDTTKIKWKGHKKKEVLTGIIIGIVSAYIVCYF 

£895. nt 

ATGATAAGGGCATTGCTTACCAATGATCTTTTTTTGTCTTGTCTTGTATCAGGAATTTCTGCTCAAGTGATTAAAT 
ATGGTATCCAAACTGTAAAAACAAGAAAGTTAAAACTAACTCCAGTACATCTTTTAAAAAAAATTTTTCTAGAAAC 
AGGAGGCATGCCAAGTAGTCATTCATCAACGGTCACCGCTCTTTCAACCTCAATCGCACTAACTGAAGGAATAGAT 
ACAAATTTTATAATAGCTCTTGCATTTGCCCTTATTACAATAAGAGATTCTTTCGGCGTAAGATATATGTCTGGAG 
TTCAAGCAGAATATTTAAATGCATTATCAGAAAAATTAAAAAAAGA^ 

GGTCAAGGGGCACAAAAAGAAAGAGGTTCTAACGGGCATAATAATAGGAATAGTCTCTGCGTATATTGTGTGCTAT 
TTTTAG 

t895.nt - 

GCTCAAGTGATTAAATATGGTATCCAAACTGTAAAAACAAGAAAGTTAAAACTAACTCCAGTACATCTTTTAAAAA 
AAATTTTTCTAGAAACAGGAGGCATGCCAAGTAGTCATTCATCAACGGTCACCGCTCTTTCAACCTCAATCGCACT 
AACTGAAGGAATAGATACAAATTTTATAATAGCTCTTGCATTTGCCCTTATTACAATAAGAGATTCTTTCGGCGTA 
AGATATATGTCTGGAGTTCAAGCAGAATATTTAAATGCATTATCAGAAAAATTAAAAAAAGAAATAAAAATTGACA 
CAACAAAAATAAAAGTGGTCAAGGGGCACAAAAAGAAAGAGGTTCTAACGGGCATAATAATAGGAATAGTCTC^ 

GTATATTGTGTGCTATTTTTAG 
£605. aa 

MYIGAAGKSFSIIIDSAFLSNCFLFIGSFSRSDSLMSLSNSRFEYPYDASCEFSLVNIVKYVCGSKYSPMRPTLII 
SKLPVFLLLVRTGQFSLVSIRLIFRIFFHWFZ 

t605.aa . 

CFLFIGSFSRSDSLMSLSNSRFEYPYDASCEFSLVNIVKWCGSKYSPMRPTLIISKLPVFLLLVRTGQFSLVSIR 
LIFRIFFHWFZ 

f605.nt 

ATGTATATTGGTGCAGCAGGAAAATCTTTTTCAATTATTATTGATTCTGCTTTTCTGAGTAATTGTTTTCTTTTTA 
TAGGATCTTTTTCAAGATCTGATTCTCTGATGAGTTTGTCAAATTCTAGGTTTGAATATCCGTATGATGCAAGTTG 
TGAATTTTCTCTTGTGAATATAGTAAAGTATGTGTGTGGATCTAAATATTCCCCAATGCGTCCAACTCTTATTATT 
TCAAAATTGCCAGTATTTCTGCTGTTGGTAAGAACAGGCCAATTTTCGTTGGTAAGCATAAGATTGATATTTAGAA 

TTTTTTTCCATTGGTTTTGA ^ 
t605.nt 

TGTTTTCTTTTTATAGGATCTTTTTCAAGATCTGATTCTCTGATGAGTTTGTCAAATTCTAGGTTTGAATATCCGT 
ATGATGCAAGTTGTGAATTTTCTCTTGTGAATATAGTAAAGTATGTGTGTGGATCTAAATATTCCCCAATGCGTCC 
AACTCTTATTATTTCAAAATTGCCAGTATTTCTGCTGTTGGTAAGAACAGGCCAATTTTCGTTGGTAAGCATAAGA 

TTGATATTTAGAATTTTTTTCCATTGGTTTTGA 
f606.aa 

MKI^RSLFLIIFFLTFLCCIWKERKEG 

AKGWDISSDGTWTFNLREKITWSDGVAITAEGIRKSYLRILNKETGSKYVEMVKSVIKNGQKYFDGQVTDSELGI 
RAIDEKTLEITLESPKPYFIDMLVHQSFIPVPVHVTEKYGQNWTSPENMV^ 

SNEVELEEITFYTTNDSSTAYKMYENEELDAIFGSIPPDLIKNLKLRSDYYSSAVNAIYFYAFNTHIKPLDNVKIR 
KALTuAIDRETLT v KVLDNGTTPTRRATPNFSSYSYAKSLELFNPEIAKTLLAEAGY?NGNGFPILKLKYNTNEAN 
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KKICEFIQNQWKKNLNIDVELENEEWTTYLNTKANGNYEIARAGWIGDYADPLTFLSIFTQGYTQFSSHNYSNPEY 
NELIKKSDLELDPIKRQDILRQAEEIIIEKDFPIAPIYIYGNSYLFRNDKWTGWNTNILERFDLSQLKLKNKZ 

t606.aa 

CCNNKERKEGVSFKISLGAEPSSLDPQLAEDNVASKMIDTMFRGIVTGDPNTGGNKPGLAKGWDISSDGTVYTFNL 
REKITWSDGVAITAEGIRKSYLRILNKETGSKYVEMVKSVIKNGQKYFDGQVTDSELGIRAIDEKTLEITLESPKP 
YFIDMLVHQSFIPVPVHVTEKYGQNWTSPENMVTSGPFKLKERIPNEKYVFEKNNKYYDSNE.VELEEITFYTTNDS 
STAYKMYENEELDAIFGSIPPDLIKNLKLRSDYYSSAVNAIYFYAFNTHIKPLDNVKIRKALTLAIDRETLTYKVL 
DNGTTPTRRATPNFSSYSYAKSLELFNPEIAKTLLAEAGYPNGNGFPILKLKYNTNEANKKICEFIQNQWKKNLNI 
DVELENEEWTTYLNTKANGNYEIARAGWIGDYADPLTFLSIFTQGYTQFSSHNYSNPEYNELIKKSDLELDPIKRQ 
DILRQAEEIIIEKDFPIAPIYIYGNSYLFRNDKWTGWNTNILERFDL.SQLKLKNKZ 

f60'6.nt 

ATGAAATTACAAAGGTCATTATTTTTAATAATATTTTTTCTAACTTTTCTTTGTTGTAATAACAAGGAAAGAAAAG 
AAGGAGTATCATTTAAAATAAGCTT.GGGAGCAGAGCCAAGCAGTCTTGACCCTCAATTAGCAGAGGATAATGTCGC 
ATCAAAAATGATTGACACAATGTTTAGAGGGATTGTTACAGGAGATCCTAATACAGGGGGAAATAAACCGGGACTT 
GCAAAAGGGTGGGATATTTCTTCTGATGGAACAGTTTACACATTTAACCTAAGAGAAAAAATCACTTGGAGTGACG 
GAGTTGCAATCACTGCAGAAGGAATTAGAAAATCTTATCTTAGAATTTTAAATAAAGAAACTGGCTCAAAGTACGT 
TGAAATGGTTAAATCGGTAATTAAAAATGGTCAAAAATATTTTGATGGACAAGTGACTGACTCTGAACTTGGAATT 
AGAGCGATTGATGAAAAAACATTAGAAATAACACTGGAATCACCAAAACCTTATTTTATTGATATGTTAGTACACC 
AATCATTTATTCCAGTACCAGTTCATGTTACCGAAAAGTATGGACAAAACTGGACAAGCCCCGAAAACATGGTGAC 
AAGTGGTCCTTTTAAATTAAAAGAAAGAATTCCTAACGAAAAATATGTCTTTGAAAAAAATAACAAATACTACGAC 
TCAAATGAAGTAGAATTAGAAGAGATTACATTTTACACAACAAATGACAGCTCAACAGCGTATAAAATGTATGAAA 
ATGAAGAGCTAGATGCAATTTTTGGTTCCATACCCCCAGATCTAATCAAAAATCTAAAATTAAGAAGCGACTATTA 
CTCATCAGCTGTTAATGCCATATACTTTTACGCGTTCAATACACACATCAAACCACTTGACAACGTTAAAATTAGA 
AAAGCCTTAACTCTTGCTATTGACAGAGAAACGCTTACATATAAAGTTCTTGACAACGGGACTACCCCTACAAGAA 
GAGCAACTCCCAACTTTAGTTCATATTCTTATGCAAAAAGTTTAGAATTATTTAATCCTGAAATTGCAAAAACCCT 
TCTAGCTGAAGCTGGATATCCTAATGGCAATGGATTTCCAATTTTAAAATTAAAATACAATACAAACGAAGCAAAT 

AAAAAAATTTGTG AATTT ATTC AAAAC C AATGGAAAAAAAATTT AAAT ATTG ATGTGGAAC TTGAAAAC GAAGAAT 
GGACAACATACTTAAACACTAAGGCAAATGGAAATTATGAAATAGCAAGAGCAGGATGGATAGGCGATTATGCTGA. 
TCCTTTGACATTTTTAAGCATATTCACACAAGGATACACACAATTCTCATCTCATAATTACTCAAACCCAGAATAC 
AACGAACTTATAAAGAAATCCGACCTTGAGCTTGATCCAATAAAAAGACAAGACATTTTAAGACAAGCAGAAGAGA 
TAATTATTGAAAAAGATTTTCCAATAGCACCAATATACATATATGGGAACAGTTACCTTTTCAGAAATGACAAATG 
GACAGGGTGGAACACCAATATTTTAGAAAGATTTGATTTATCTCAGCTAAAATTAAAAAATAAATAA 

t606.nt 

TGTTGTAATAACAAGGAAAGAAAAGAAGGAGTATCATTTAAAATAAGCTTGGGAGCAGAGCCAAGCAGTCTTGACC 
CTCAATTAGCAGAGGATAATGTCGCATCAAAAATGATTGACACAATGTTTAGAGGGATTGTTACAGGAGATCCTAA 
TACAGGGGGAAATAAACCGGGACTTGCAAAAGGGTGGGATATTTCTTCTGATGGAACAGTTTACACATTTAACCTA 
AGAGAAAAAATCACTTGGAGTGACGGAGTTGCAATCACTGCAGAAGGAATTAGAAAATCTTATCTTAGAATTTTAA 
ATAAAGAAACTGGCTCAAAGTACGTTGAAATGGTTAAATCGGTAATTAAAAATGGTCAAAAATATTTTGATGGACA 
AGTGACTGACTCTGAACTTGGAATTAGAGCGATTGATGAAAAAACATTAGAAATAACACTGGAATCACCAAAACCT 
TATTTTATTGATATGTTAGTACACCAATCATTTATTCCAGTACCAGTTCATGTTACCGAAAAGTATGGACAAAACT 
GGACAAGCCCCGAAAACATGGTGACAAGTGGTCCTTTTAAATTAAAAGAAAGAATTCCTAACGAAAAATATGTCTT 
TGAAAAAAATAACAAATACTACGACTCAAATGAAGTAGAATTAGAAGAGATTACATTTTACACAACAAATGACAGC 
TCAACAGCGTATAAAATGTATGAAAATGAAGAGCTAGATGCAATTTTTGGTTCCATACCCCCAGATCTAATCAAAA 
ATCTAAAATTAAGAAGCGACTATTACTCATCAGCTGTTAATGCCATATACTTTTACGCGTTCAATACACACATCAA 
ACCACTTGACAACGTTAAAATTAGAAAAGCCTTAACTCTTGCTATTGACAGAGAAACGCTTACATATAAAGTTCTT 
GACAACGGGACTACCCCTACAAGAAGAGCAACTCCCAACTTTAGTTCATATTCTTATGCAAAAAGTTTAGAATT.-^^i 
' ^TAATCCTGAAATTGCAAAAACCCTTCTAGCTGAAGCTGGATATCCTAATGGCAATGGATTTGCAATTTTAA.^ri 
AAAATACAATACAAACGAAGCAJVATAAAAAAATTTGTGAATTTATTCAAAACCAATGGAAAAAAAATTTAAATAT 
GATG T GGAACTTGAAAACGAAGAATGGACAACATACTTAAACACTAAGGCAAATGGAAATTATGAAATAGCAAGA^ 
CAGGATGGAT V3GCGATTATGCTGATCCTTTGAC ATTTTTAAGC AT ATTC ACAC AAGGATACACAC AATTCTC A , v. 
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TC ATAATTAC TC AAAC C C AGAATAC AAC GAAC TT AT AAAG AAATCC GAC C TTGAGC TTGATC C AATAAAAAG AC AA 
GACATTTTAAGACAAGCAGAAGAGATAATTATTGAAAAAGATTTTCCAATAGCACCAATATACATATATGGGAACA 
GTTACCTTTTCAGAAATGACAAATGGACAGGGTGGAACACCAATATTTTAGAAAGATTTGATTTATCTCAGCTAAA 
ATTAAAAAATAAATAA 

f679.aa 

MFNRSSCVLQNFLFLFLFLSLVSCFAKKEISGNNFIKAHSKEFDL^JLNWLWNFDYTKKNFDKHFNIDPSSYXYVA 
YliFKKIGFEEKFVEYMKKAIANGDSIASQFAGIKLIEYFNSAKEYFASELIGEKLYKKYENNKFIILGYFKSLYWQ 
KKNDKALSLLNKLDKMKFSDYQENENILLKAVLYLNLSNVSESKIYFNELFENLPANYLHVRAYDYFIIENKSRYF 
GANFLNLVRFKYEVANGNFNGAINILNKNGLN^^ 

NLREKNNLGLLLLKEYLEGLDLNNEINRLDLLNTAFSNLXFTKSARDYFAESLPKFYTEGDKKNSTFIKILEEYIL 

ESIQLEDYGNLYKLYSNAQKVISNSVLSKLAFINARLIYHKLIKPNVSGEYKSLLHSAVNYDKWSYSSFMSRYLLD 

QNIDEFFTGGSDI'KYEQSDYEIFLEGFLKFNLCNYVRGFISEDFRNGYKFSLDFYRKVYDELLKSENYYDATLVIN 

YLVNQDESALMENDYKRLYPYLYGSLIEYWAKRRGL^^ 

KELKYFfcTYDLKIPKDNIIIGTYYLKKRIST^^ 

KKILVYSVFYDALYEKKGIDSVIVKIMGEFPKNZ 

t679 .aa 

CFAKKEISGNNFIKAHSKEFDLNNLNWLWNFDYTKKN^ 

DSIASQFAGIKLIEYFNSAKEYFASELIGEKLYKKYENNKFIILGYFKSLYWQKKNDKALSLLNKLDKMKFSDYQE 
NENILLKAVLYLNLSWSESKIYFNELFENLPANYLHVRAYDYFIIENKSRYFGANFLNLVRFKYEVANGNFNGAI 
NILNKNGLNDYYDNNIVLSDVYKAFISSGKVS^ 

NEINRLDLLNTAFSNLIF.TKSARDYFAESLPKFYTEGDKKNSTFIKILEEYILESIQLEDYGNLYKLYSNAQKVIS 
NSVLSKLAFINARLIYHKLIKPNVSGEYKSLLHSAVNYDKWSYSSFMSRYLLDQNIDEFFTGGSDIKYEQSDYEIF 
LEGFLKFNLCNYWGFISEDFRNGYKFSLDFYRKVYDELLKSENYYD^^ 
GSLIEYWAKRRGLEASVVFSLIKAESSFEKNAVSKPGAVGLM^ 

LKKRISTTGSLYKALASYNGGIGNVRKWEKSYGHLSKELFIEAIPFSQTRNYIKKILVYSVFYDALYEKKGIDSVI 
VKIMGEFPKNZ 

f679.nt 

ATGTTTAATAGAAGTTCTTGTGTATTACAAAATTTTCTTTTTCTTTTTTTATTTTTAAGTTTAGTTTCTTGCTTTG 
CAAAAAAAGAAATCTCAGGCAATAATTTTATTAAGGCGCATTCAAAAGAGTTTGATTTAAATAATTTAAATTGGTT 
ATGGAATTTTGATTATACAAAAAAAAATTTTGATAAGCATTTTAACATAGATCCAAGTTCTTACATATATGTTGCT 
TATTTATTTAAAAAAATAGGATTTGAAGAGAAATTTGTAGAGTATATGAAAAAGGCCATAGCTAATGGAGATAGCA 
TTGCATCCCAGTTTGCTGGGATTAAGCTTATTGAATATTTTAACTCAGCAAAAGAGTATTTTGCATCTGAATTGAT 
TGGAGAGAAGCTTTATAAAAAATACGAAAATAATAAATTTATTATACTGGGGTACTTTAAAAGTCTTTATTGGCAA 
AAGAAAAACGATAAGGCACTTAGTCTTTTAAATAAGCTTGATAAGATGAAATTTTCTGATTATCAGGAAAATGAAA 
ATATTTTATTAAAAGCAGTTCTTTACCTTAATCTTTCTAATGTAAGTGAGTCAAAAATTTATTTTAATGAGCTTTT 
TGAGAACTTACCTGCAAATTATTTACATGTAAGAGCTTATGATTATTTTATTATTGAAAATAAGTCTAGGTATTTT 
GGTGCAAATTTTTTAAATCTTGTTAGATTTAAGTATGAAGTGGCAAATGGCAATTTTAATGGTGCAATAAATATAT 
TAAATAAAAATGGTTTAAATGATTATTATGACAATAACATTGTATTAAGTGATGTTTATAAGGCTTTTATTAGTTC 
TGGCAAAGTTTCAAATGCTTTAACATTTTTTAGTAAAATAAAGAGCAAATATAAAAATTATTATTTAGGTATTCTA 
AACCTTAGAGAGAAAAATAATTTAGGACTTCTTCTTTTAAAAGAATATCTTGAAGGTTTAGATCTTAACAATGAGA 
TTAACAGGCTTGATTTGCTTAATACTGCTTTTAGCAATTTAATTTTTACTAAGAGCGCAAGGGATTATTTTGCCGA 
. AAGTTTAC C C AAGTTTT AT AC CGAGGGC GAT AAAAAAAATTC TAC TTTT ATT AAG ATTTT AGAAGAGT ATA.TTTTG 
GAATCAATTCAGCTTGAAGACTATGGCAATCTTTATAAGCTTTATTCTAATGCTCAAAAAGTTATTTCTAATTCTG 
TTTTGTCTAAGCTTGCTTTTATTAATGCAAGGCTTATATATCATAAATTAATTAAACCTAACGTAAGCGGAGAATA 
CAAGAGTCTTTTGCATTCTGCTGTTAATTATGATAAATGGTCTTATTCTTCATTTATGAGTAGGTACTTATTAGAT 
CAAJVATATTGATGAATTTTTTACAGGTGGGTCTGATATTAAGTATGAGCAATCCGATTATGAGATTTTTTTGGAAG 
GGTTTTTAAAATTCAATCTTTGTAATTATGTTAGAGGGTTTATTTCTGAGGATTTTAGGAATGGATATAAATTTTC 
ACTTGATTTTTATCGAAAAGTATACGATGAACTTTTAAAGAGTGAAAATTATTACGATGCAACTCTTGTGATTAAT 
TATCTTGTAAATCAAGATGAATCTGCTTTAATGGAGAATGACTATAAAAGACTTTATCCTTATTTGTATGGATCTT 
TGATAGAATATTGGGCTAAAAGGAGAGGGCTTGAAGCTAGTGTTGTATTTTCTTTAATAAAAGCAGAGAGTAGCTT 
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-TGAAAAAAATGCTGTCTCAAAACCGGGTGCTGTTGGCCTTATGCAGGTTATGCCATCAACAGCAAATGATATTTCT 
AAAGAACTTAAGTATTTTAACTATGATTTAAAGATTCCAAAAGATAATATAATAATTGGAACATATTATTTAAAAA 
AAAGAATATCTACAACTGGCAGTCTTTATAAGGCTCTTGCGTCTTATAATGGGGGTATTGGTAATGTTAGAAAGTG- 
GGAGAAAAGTTATGGACATTTGTCAAAAGAGCTTTTTATTGAGGCAATTCCCTTTAGTCAAACTAGGAATTATATT 
AAAAAAATATTAGTTTATTCGGTATTTTATGATGCTTTGTATGAAAAGAAGGGAATAGATTCAGTAATAGTTAAAA 
TTATGGGCGAATTCCCCAAAAATTAA 

t679.nt _ ' 

TGGTTTGCAAAAAAAGAAATCTCAGGCAATAATTTTATTAAGGCGCATTCAAAAGAGTTTGATTTAAATAATTTAA 
ATTGGTTATGGAATTTTGATTATACAAAAAAAAATTTTGATAAGCATTTTAACATAGATCCAAGTTCTTACATATA 
TGTTGCTTATTTATTTAAAiWVAATAGGATTTGAAGAGAAATTTGTAGAGTATATGAAAAAGGCCATAGCTAATGGA 
GATAGCATTGCATCCCAGTTTGCTGGGATTAAGCTTATTGAATATTTTAACTCAGCAAAAGAGTATTTTGCATCTG 
AATTGATTGGAGAGAAGCTTTATAAAAAATACGAAAATAATAAATTTATTATACTGGGGTACTTTAAAAGTCTTTA 
TTGGCAAAAGAAAAACGATAAGGCACTTAGTCTTTTAAATAAGCTTGATAAGATGAAATTTTCTGATTATCAGGAA 
AATGAAAATATTTTATTAAAAGCAGTTCTTTACCTTAATCTTTCTAATGTAAGTGAGTCAAAAATTTATTTTAATG 
AGCTTTTTGAGAACTTACCTGCAAATTATTTACATGTAAGAGCTTATGATTATTTTATTATTGAAAATAAGTCTAG 
GTATTTTGGTGCAAATTTTTTAAATCTTGTTAGATTTAAGTATGAAGTGGCAAATGGCAATTTTAATGGTGCAATA 
AATATATTAAATAAAAATGGTTTAAATGATTATTATGACAATAACATTGTATTAAGTGATGTTTATAAGGCTTTTA 
TTAGTTCTGGCAAAGTTTCAAATGCTTTAACATTTTTTAGTAAAATAAAGAGCAAATATAAAAATTATTATTTAGG 
TATTCTAAACCTTAGAGAGAAAAATAATTTAGGACTTCTTCTTTTAAAAGAATATCTTGAAGGTTTAGATCTTAAC 
AATGAGATTAACAGGCTTGATTTGCTTAATACTGCTTTTAGCAATTTAATTTTTACTAAGAGCGCAAGGGATTATT 
TTGCCGAAAGTTTACCCAAGTTTTATACCGAGGGCGATAAAAAAAATTCTACTTTTATTAAGATTTTAGAAGAGTA 
TATTTTGGAATCAATTCAGCTTGAAGACTATGGCAATCTTTATAAGCTTTATTCTAATGCTCAAAAAGTTATTTCT 
AATTCTGTTTTGTCTAAGCTTGCTTTTATTAATGCAAGGCTTATATATCATAAATTAATTAAACCTAACGTAAGCG 
GAGAATACAAGAGTCTTTTGCATTCTGCTGTTAATTATGATAAATGGTCTTATTCTTCATTTATGAGTAGGTACTT 
ATTAGATCAAAATATTGATGAATTTTTTACAGGTGGGTCTGATATTAAGTATGAGCAATCCGATTATGAGATTTTT 
TTGGAAGGGTTTTTAAAATTCAATCTTTGTAATTATGTTAGAGGGTTTATTTCTGAGGATTTTAGGAATGGATATA 
AATTTTCACTTGATTTTTATCGAAAAGTATACGATGAACTTTTAAAGAGTGAAAATTATTACGATGCAACTCTTGT 
GATTAATTATCTTGTAAATCAAGATGAATCTGCTTTAATGGAGAATGACTATAAAAGACTTTATCCTTATTTGTAT 
GGATCTTTGATAGAATATTGGGCTAAAAGGAGAGGGCTTGAAGCTAGTGTTGTATTTTGTTTAATAAAAGCAGAGA 
GTAGCTTTGAAAAAAATGCTGTCTCAAAACCGGGTGCTGTTGGCCTTATGCAGGTTATGCCATCAACAGCAAATGA 
TATTTCTAAAGAACTTAAGTATTTTAACTATGATTTAAAGATTCGAAAAGATAATATAATAATTGGAACATATTAT 
TTAAAAAAAAGAATATCTACAACTGGCAGTCTTTATAAGGCTCTTGCGTCTTATAATGGGGGTATTGGTAATGTTA 
GAAAGTGGGAGAAAAGTTATGGACATTTGTCAAAAGAGCTTTTTATTGAGGCAATTCCCTTTAGTCAAACTAGGAA 
TTATATTAAAAAAATATTAGTTTATTCGGTATTTTATGATGCTTTGTATGAAAAGAAGGGAATAGATTCAGTAATA 
GTTAAAATTATGGGCGAATTCCCCAAAAATTAA 

f 11-12 .nt 

TAAAAGGAGA ATATTTTTAT GAGAAAAAGT TTGTTTTTAT ATGCATTATT AATGGGAGGA 

TTGATGTCTT GTAATCTAGA TTCCAAATTA TCTAGTAACA AAGAACAAAA AAATAACAAT 

AATGTAAAAG AAGTTTCGGA TAGTGTTCAA -GAAGATGGTC TTAATGATTT ATATAATAAT 

CAAGAAAAGC AAAAAAGCTT TACTAAAAAT TTTGGAGAAC GGAAATATGA GGATTTAATT 

AATCCTATAG AGCCTATAAT ACCTTCAGAA TCACCAAAGA ATAAGGCTAA TATACCAAAT 

ATTTCAATTG CGCATACTGA AAAAAAAGAG ACAAAAAAGG AGAATTTAAT CCCTTCTACT 

AATGAAGAAA AGGAAGCTGA TGCAGCAATT AAATATTTAG AAGAAAATAT TCTTAAAAAC 

TCTAAATTTT .CTGAATTAAT TAGAGAAGTA CGTGTAATTA AAGATGAATA TGCTTTAATA 

AAAGCTGATT TGTATGATGT AATTGGAAAG ATTAACAATA AAAAAACATC ATTAATGGAG 

AATCCTAAGA ACAATAGAGA TAAGATAAAT AAATTAACAC AATTGTTGCA AAATAATTTA 

AAGATAGATA GTGAACTTGA GCAGCTTATA AATATGATTG ATATGGCAGA AAATGAAATA 

AGCTCTGCGG CTTTCTTTTT TGACAACGCT CAGAAAAGGT TAAAAGAAAG. CATTATTAAA 

AGATTAGAGA GTAAAAATAA TAGATCTTAT GCATTAAAAT TGTCTAGACA GGCTTTAAGT 

GACGCAAGAA GTGCTTTAAG TAATTTAGAA TCTTTTGCCT CTAAAAGAAT TGAACCAATG 

GTGAGAAAGG AAGAAATAAA AG AGC TT ATT AAACATGCAA AAACTGTTTT AGAAAGTCTC 
AATAAAAAAT AA 
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tll-12.nt 

TTGTAATCTAGATTCCAAATTATCTAGTAACAAAGAACAAAAAAATAACAATAATGTAAAAGAAGTTTCGGATAGT 
GTTCAAGAAGATGGTCTTAATGATTTATATAATAATCAAGAAAAGCAAAAAAGCTTTACTAAAAATTTTGGAGAAC 
GGAAATATGAGGATTTAATTAATCCTATAGAGCCTATAATACCTTCAGAATCACCAAAGAATAAGGCTAATATACC 
AAATATTTCAATTGCGCATACTGAAAAAAAAGAGACAAAAAAGGAGAATTTAATCCCTTCTACTAATGAAGAAAAG 
GAAGCTGATGCAGCAATTAAATATTTAGAAGAAAATATTCTTAAAAACTCTAAATTTTCTGAATTAATTAGAGAAG 
_TACGTGTAATTAAAGATGAATATGCTTTAATAAAAGCTGATTTGTATGATGTAATTGGAAAGATTAACAATAAAAA 
AACATCATTAATGGAGAATCCTAAGAACAATAGAGATAAGATAAATAAATTAACACAATTGTTGCAAAATAATTTA 
AAGATAGATAGTGAACTTGAGCAGCTTATAAATATGATTGATATGGCAGAAAATGAAATAAGCTCTGCGGCTTTCT 
TTTTTGACAACGCTCAGAAAAGGTTAAAAGAAAGCATTATTAAAAGATTAGAGAGTAAAAATAATAGATCTTATGC 
ATTAAAATTGTCTAGACAGGCTTTAAGTGACGCAAGAAGTGCTTTAAGTAATTTAGAATCTTTTGCCTCTAAAAGA 
ATTGAACCAATGGTGAGAAAGGAAGAAATAAAAGAGCTTATTAAACATGCAAAAACTGTTTTAGAAAGTCTCAATA 

AAAAA 
fll-12.aa 

KENIFMRKSL FLYALLMGGL MSCNLDSKLS SNKEQKNNNN VKEVSDSVQE DGLNDLYNNQ 

EKQKSFTKNF GERKYEDLIN 'PIEPIIPSES PKNKANIPNI SIAHTEKKET KKENLIPSTN 

EEKEADAAIK YLEENILKNS KFSELIREVR VIKDEYALIK ADLYDVIGKI NNKKTSLMEN 

PKNNRDKINK LTQLLQNNLK IDSELEQLIN MIDMAENEIS SAAFFFDNAQ KRLKESIIKR 

LESKNNRSYA LKLSRQALSD ARSALSNLES FASKRIEPMV RKEEIKELXK HAKTVLESLN 
KK 

tll-12.aa 

CNLDSKLSSNKEQKNNNN^ 

NI SIAHTEKKETKKENLI P STNEEKEAI?AAI KYLEENI LKNSKFSELIREVRVIKDEYAL I KADLYDVIGKINNKK 
TSLMENPKNNRDKINKLTQLLQNNLKIDSEL^ 

LKLSRQALSDARSALSNLESFASKRIEPMVRKEEIKELIKHAKTVLESLNKK 
fll-4.nt 

TAAAGGAGTT TACAAATGAG TAAACTAATA TTGGCAATAT CTATACTGCT AATAATTTCA 

TGTAAATGGT ATGTAGACAA TACCATTGAT GAAGCAACTG TAGAAAGTAA '. ATCAGCACTA 

ACATCTATTG ATCAAGTATT AGATGAGATA AGTGAAGCCA CAGGCCTAAG TTCGGAAAAA 

ATCACAAAAT TAACTCCGGA AGAGCTAGAA AATTTAGCAA AGGAAGCTCA AGATGACTCT 

•GAAAAATCCA AAAAAGAAAT TGAAGATCAA AAAAATACCA AGGAAAGTAA AAACATAGAA 

GTAAAGGATA CTCCTCGCTT AATCAAATTG ATAAAGAATT CATCAGAAAA AATTGATTCG 

GTTTTTCAAA CACTAATTAA TATAGGTTAT AATGCTACCT ATGCAGCCAA AAGTAATTTG 

AAGAATGGAC TAAAGATGGT GAAATTACTG GATGAGTTGC TAAAAATATC GGTAAGTAGC 

. AATGGTGATA AAAGTACCCA AAAAT AC AAT GAACTTAAAA CCGTTGTAAA TAAGTTTAAT 

GCTGAAAATT CGGTAAGCGT TTCTTTTAAA GAACATTCAA ACAGTAAAAT TGAAACTAAA 

AAATGTATTC AAACTCTTAT GAAAAATGTA GAAACATACT TTGAAGGTGT ATGCAGCGAA 

CTTAAAAACA AAAATGATGG TGAGTACGAA AAAACATTGA CAACTTTAAG CTAA 

' tll-4.nt 

ATGTAAATGGTATGTAGACAATACCATTGATGAAGCAACTGTAGAAAGTAAATCAGCACTAACATCTATTGATCAA 
GTATT AGATGAGAT AAGTGAAGC C AC AGGCCT AAGTTCGGAAAAAATC AC AAAATT AAC TC CGGAAGAGCTAG AAA 
ATTT AGC AAAGG AAGC TC AAGATG ACTC TG AAAAATCC AAAAAAGAAATTGAAGATC AAAAAAAT AC C AAGGAAAG 
TAAAAACATAGAAGTAAAGGATACTCCTCGCTTAATCAAATTGATAAAGAATTCATCAGAAAAAATTGATTCGGTT 
TTTCAAACACTAATTAATATAGGTTATAATGCTACCTATGCAGCCAAAAGTAATTTGAAGAATGGACTAAAGATGG 
TGAAATT AC TGGATGAGTTGC T AAAAATATCGGT AAGT AGC AATGGTGAT AAAAGT ACC C AAAAAT AC AATGAAC T 
TAAAACCGTTGTAAATAAGTTTAATGCTGAAAATTCGGTAAGCGTTTCTTTTAAAGAACATTCAAACAGTAAA 
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GAAACTAAAAAATGTATTCAAACTCTTATGAAAAATGTAGAAACATACTTTGAAGGTGTATGCAGCGAACTTAAAA 
ACAAAAATGATGGTGAGTACGAAAAA 

fll-4-aa 

RSLQMSKLIL AISILLIISC KWYVDNTIDE ATVESKSALT SIDQVLDEIS EATGLSSEKI 

TKLTPEELEN LAKEAQDDSE KSKKEIEDQK NTKESKNIEV KDTPRLIKLI KNSSEKIDSV 

FQTLINIGYN ATYAAKSNLK NGLKMVKLLD ELLKISVSSN GDKSTQKYNE LKTWNKFNA 

ENSVSVSFKE HSNSKIETKK CIQTLMKNVE TYFEGVCSEL KNKNDGEYEK TLTTLS 



tll-4.aa 

CKWYVDNTIDEATVESKSALTSIDQVLDEISEATGLSSEKITKLTPEELENLAKEAQDDSEKSKKEIEDQKNTKES . 
KNIEVKDTPRLIKLIKNSSEKIDSVFQTLINIGYNAT^^ 

KTWNKFNAENS VS VS F KEHSNSKI ETKKC I QTLMKNVETYFEGVC S ELKNKNDGE YEK 
f 112-1. nt 

TGAATCTCTA AAGATTTTAG CAGGGGAGAA AATATGAAAA AAAGTTTTTT ATC AAT AT AC 
ATGTTAATTT CAATAAGTTT ATTATCATGT GATGTTAGTA GATTAAATCA GAGAAATATT 
AATGAGCTTA AAATTTTTGT TGAAAAGGCC AAGTATTATT CTATAAAATT AGAC GCTATT 
TATAACGAAT GTACAGGAGC . ATATAATGAT ATTATGACTT ATTCGGAAGG TACATTTTCT 
GATCAAAGTA AGGTTAATCA AGCTATATCT ATATTTAAAA AAGACAATAA AATTGTTAAT 
AAGTTTAAGG AGCTTGAAAA GAT T AT AG AA GAATACAAAC CTATGTTTTT AAGTAAATTA 
ATTGATGATT TTGCGGGATC CGTT " ' 

tll2-l.nt 

ATGTGATGTTAGTAGATTAAATCAGAGAAATATTAATGAGCTTAAAATTTTTGTTGAAAAGGCCAAGTATTATTCT 
ATAAAATTAGACGCTATTTATAACGAATGTACAGGAGCATATAATGATATTATGACTTATTCGGAAGGTACATTTT 
CTGATCAAAGTAAGGTTAATCAAGCTATATCTATATTTAAAAAAGACAATAAAXTTGTTAAT^ 
TGAAAAGATTATAGAAGAATACAAACCTATGTTTTTAAGTAAATTAATTGATGATTTT 

fll2-l.aa 

ISKDFSRGEN MKKSFLSIYM LISISLLSCD VSRLNQRNIN ELKIFVEKAK - YYSIKLDAIY 
NECTGA YND I MTYSEGTFSD QSKVNQAISI FKKDNKIVNK FKELEKIIEE YKPMFLSKLI 
DDFAGSV - 

tl!2-l.aa 

CDVSRLNQRNINELKIFVEKAKYYSIKLDAIYNECTGAYNDIMTYSEGTFSDQSKWQAISIFKKDNKIVNKFKEL 
EKI I EEYKPMFLSKL IDDF 

fl4-8.nt 

TAAATACAGA. GCCATTCAAG GAGAGTATTT ATGAAATACT ATATATGTGT GTGTGTTTTT 

TTGCTTTTGA ATGCTTGCAA TTCAGATTTT AGCACTAATC AAGAAGATAT TAAATATCCA 

TCTGATAAAG AGAAATCAAA ATCCAACATG GAAGCAAGCT CTAAAGAAGA AGATCCAAAT 

AAAAAAATAA AAAAT AC AC T GCTTAATGAT TTAATAAATT TGATAGAAAT AGCTAATGAG 

CATAAAGAAA 'AATATGAAAA AAGAATGCAA GAAGAACCTT CAGATCAATA CGGAATATTG 

GCTTTCCAGG AATTAGACTT GTCCGTTGGA AAAAT ATC TG AAGACACCCC GCAATCTAAA 

AAATTTAGAA AAAAC AC C T A TTCTCCCTTA AGCGCTATTG ATGTCAATAA ATTAAAAGAT 

CTTTCAGAGA TTATAAGAAA TTCGGGCCAA ATACAAGGTT TATTTAATAT TTTCAACAGA 

TTCGGAGGCA TTTTTGACGA CTCACTTAAT CACGTATATT CTAAAAAAGA TATCCTAGGG 

GGACTAGAAA TTTTGGATTT AGATAAACTA AAAAATTCGT TTGAAAAATT AC T ATC TATA 
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AAAGAAACTT TCTCAAAAAT GCTAAATCAA CTTTTATTAG- ATTATAAAAA TGATAAAGAT 
CATATACGAA CAGAGACAAA TAAACTTAAA TCTCATACAA CTGCACTTTT CGAACAACTT 
GATAAAAAAG AAGACGAAGC ATATGAACCT AAAAATCAGA TATTTTCAAT AAGTAACCTT 

TAA 
• 

tl4-8.nf 

TTGCAATTCAGATTTTAGCACTAATCAAGAAGATATTAAATATCCATCTGATAAAGAGAAATCAAAATCCAACATG 
GAAGCAAGCTCTAAAGAAGAAGATCCAAATAAAAAAATAAAAAATACACTGCTTAATGATTTAATAAATTTGATAG 
AAATAGCTAATGAGCATAAAGAAAAATATGAAAAAAGAATGCAAGAAGAACCTTCAGATCAATACGGAATATTGGC 
TTTCCAGGAATTAGACTTGTCCGTTGGAAAAATATCTGAAGACACCCCGCAATCTAAAAAATTTAGAAAAAACACC 
TATTCTCCCTTAAGCGCTATTGATGTCAATAAATTAAAAGATCTTTCAGAGATTATAAGAAATTCGGGCCAAATAC 
AAGGTTTATTTAATATTTTCAACAGATTCGGAGGCATTTTTGACGACTCACTTAATCACGTA-TATTCTAAAAAAGA 
TATCCTAGGGGGACTAGAAATTTTGGATTTAGATAAACTAAAAAATTCGTTTGAAAAATTACTATCTATAAAAGAA 
ACTTTCTCAAAAATGCTAAATCAACTTTTATTAGATTATAAAAATGATAAAGATCATATACGAACAGAGACAAATA 
AACTTAAATCTCATACAACTGCACTTTTCGAACAACTTGATAAAAAAGAAGACGAAGCATATGAACCTAAAAATCA 

G 

' f 14-8 -aa 

IQSKSRRVFM KYYICVCVFL LLNACNSDFS TNQEDIKYPS DKEKSKSNME ASSKEEDPNK 
KIKNTLLNDL INLIEIANEH KEKYEKRMQE EPSDQYGILA FQELDLSVGK ISEDTPQSKK . 
FRKMTYSPLS AIDVNKLKDL SEIIRNSGQI QGLFNIFNRF * GGIFDDSLNK VYSKKDILGG 
LEILDLDKLK NSFEKLLSIK ETFSKMLNQL LLDYKNDKDH IRTETNKLKS HTTALFEQLD 
KKEDEAYEPK NQIFSISNL 



tl4-8.aa 

CNSDFSTNQEDIKYPSDKEKSKSNMEASSKEEDPNKKIKNTLLNDLINLIEIANEHKEKYEKRMQEEPSDQYGILA 
FQELDLSVGKISEDTPQSKKFRKNTYSPLSAIDVNKLKDLSEIIRNSGQIQGLFNIFNRFGGIFDDSLNHVYSKKD 
ILGGLEILDLDKLKNSFEKLLSIKETFSKMLNQLLLDYKNDKDHIRTETNKLKSHTTALFEQLDKKEDEAYEPKNQ 

f 17-6 .nt _ 

TAAAGGAGGG TATTTATGAA ATACCACATA ATTACAACTA TATTTGTTTT • TCTGTTTTTA 

GCTTGCAGGC CGGATTTTAA TATCGATCAA AAAGACATTA AATACCCGCC TACTGAAAAA 

TCAAGGCCCA AAACTGAAAG CTCTAAGCAA AAAGAATCAA AGCCTAAAAC AGAAGAAGAG 

CTTAAGAAAA AACAACAAGA AGAAGAGCTT AAGAAAAAAC AACAAGAAGA AGAGCTTAAG ■ 

AAAAAACAAC AAGAAGAAGA GCTTAAGAAA AAACAACAAG AAGAAGAGAA GGAAGAACTA . 

AGAAAACAAC- AACTAAAAAA TACGCTATCT AATGATTTAA AAAAGCAAAT AGAATCGGCC 

TACAATTTTA AAGAAAAATA- TGTAAAAAGT ATGGAAAAAG AACCTGAAGA CCATTACGGG 

ATGACGTCTT TTAGGGGATT GAATTGGGGG CCAGGGACTG " AAGATATATC TGACAATACC 

G AAAGATC T A * TAAGAT AT AG AAGACACACT TAT AC TGTTT TAAGCCCCCT GGATCCTCAT 

GAATTAAAGG AATTCGCAAA TATTATTCAA GATATAAATA AACTAGCATC AGTAGCAAGT 

ATATTTAATT CTTTTAGCGC TATTGGAGGA GCTCTTGACA TAGTAAGTGA TCACCTATAT 

TTCAAAAAAG ACAATCTAGA CAAACTAGAT ATTGCAGATT TAGAAATACT TAAAAATTpA 

TTTGAACAAA TATTATATAT AAAAGGAAGT GTTGCAGGAA AAGCAAAAAA ACTTTTATTA 

G ATT AT AAAA ATC T AAAAAC AGATATTAAT AAGCTTAAAT CTTATTCAAA TGAACTGGTT 

AATGGAATTA. AGCAACAAGC TCTAGAAGCA GAAAATCTAG AAGAGCTTAT AGTGTCAAAA 
TATAAACTTT AA 

tl7-6.nf 

TTGCAGGCCGGATTTTAATATCGATCAAAAAGACATTAA^JTACCCGCCTACTGAAAAATCAJ^GGCCCAAAACTG^ 
AGCTCTAJ^GCAAAAAGAATCAAAGCCTAAAACAGAAGAAGA.GCTTAAGAAJ^AAACAA 
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AAAAACAACAAGAAGAAGAGCTTAAGAAAAAACAAC 
GGAAGAACTAAGAAAACAACAACTAAAAAATACGCTA 

TTTAAAGAAAAATATGTAAAAAGTATGGAAAAAGAACCTGAAGACCATTACGGGATGACGTCTTTTAGGGGATTGA 
ATTGGGGGCC AGGGACTG AAGAT AT ATC TGAC AAT AC CG AAAGATC TAT AAGAT AT AGAAG AC AC AC TT AT AC TGT 
TTTAAGCCCCCTGGATCCTCATGAATTAAAGGAATTCGCAAATATTATTCAAGATATAAATAAACTAGCATCAGTA 
GCAAGTATATTTAATTCTTTTAGCGCTATTGGAGGAGCTCTTGACATAGTAAGTGATCACCTATATTTCAAAAAAG 
ACAATCTAGACAAACTAGATATTGCAGATTTAGAAATACTTAAAAATTCATTTGAACAAATATTATATATAAAAGG 
AAGTGTTGCAGGAAAAGCAAAAAAACTTTTATTAGATTATAAAAATCTAAAAACAGATATTAATAAG 
TATTCAAATGAACTGGTTAATGGAATTAAGCAACAAGCTCTAGAAGCAGAAAATCTAGAAGAGCTTATAGTGTCAA 
AAT AT AAAC TT 

£17-6 /aa 

RRVFMKYH I I TTIFVFLFLA CRPDFNIDQK DIKYPPTEKS RPKTESSKQK ESKPKTEEEL 
KKKQQEEELK KKQQEEELKK KQQEEELKKK QQEEEKEELR KQQLKNTLSN DLKKQIESAY 
NFKEKYVKSM EKEPEDHYGM TSFRGLNWGP GTEDISDNTE RSIRYRRKTY TVLSPLDPKE 
LKEFANHQD. INKLASVASI- FNSFSAIGGA LDIVSDHLYF KKDNLDKLDI ADLEILKNSF - 
EQILYIKGSV AGKAKKLLLD . YKNLKTDINK . LKSYSNELVN GIKQQALEAE NLEELIVSKY 
KL 

tl7-6 .aa - 

CRPDFNIDQKDIKYPPTEKSRPKTESSKQKESKPKTEEELKKKQQEEELKKKQQEEELKKKQQEEELKKKQQEEEK 
EELRKQQLKNTLSNDLKKQIESAYNFKEKYVKSMEKEPEDHYGMTSFRGLNWGPGTEDISDNTERSIRYRRHTYTV 
LSPLDPHELKEFANIIQDINKLASVASIFNSFSAIGGALDIVSDHLYFKKDNLDKLDIADLEILKNSFEQILYIKG 
SVAGKAKKLLLDYKNLKTDINKLKSYSNELVNGIKQQALEAENLEELIVSKYKL 

fl9-2.nt 

TAAAGAAAGA TTAAATCATA TTCAAGGAGA GTATTTATGA AACACTATAT AATTGTGCAT 

ATATTTGTTT TTCTATTTTT AAATGCTTGT TATCCAGTTG C ATC T AAT AA AATAGAATTA 

AAACCTAAAA CAGAAACAAG CTTAAATCAA GAAGAAGTCC CAAATCAAGA AGCAAACTAC 

AAAGAAGAAA AAGAAGCAAA AGAAGAAGGC ATTAATAAAA AAACAGAAAA CACGCTGCTT 

AATGATTTAA GAAATTTAAT AGAAACAGCT AAAAAAGATA ATGATAAATA TACACAAAAG 

TTAAAAGAAG AATCCTCA^G CCAATACGGA ATACTGGCTT TCAAAGATTT GTTCTGGCTA 

GATGGAACAA ATGAACAATT GTCCGCAAAT ACCGAAAGAT CTAAAGCCTA • TAGAAAACGA 

GCTTATAGCA TCTTAAATAC T ATT AATGAC GCTTCCTTAA AGAATTTTTC AGAAATTGTA 

ATGGCATCAG G AC AAAC AC A GGGCATATTT . AAT ACCCTTA ACTCACTTGG GGGTAATTTT 

GAAAAGATAG TTAATTGTTT GTATCCCAAA AAAGACAATT TGGAAAAATT AGAGACTTCA 

GTTTTAAAAA AGCTTAAAGA TTCTTTGGAA AATTTTTTAG AGATAAAAAA AATCGCCTCA 

GAAATGATGC ACAAGCTCTT ATTAGACTAT CAAAATAATA CAAATCGTAT "ACAAACAGAT 

AAAAATGAAC TTAAGTCTTA TGCAGACACA CTTTTCAATC AAATGACAAA* AAAACCCGAA 

GAAGCACTAA AGCTAAAAAA TACCATATGC TCAATAGAGG ACCTTTAA 

tl9-2 .nt ^ ... 

TTGTTATCCAGTTGCATCTAATAAAATAGAATTAAAACCTAAAACAGAAACAAGCTTAAATCAAGAAGAAGTCCCA 

AATCAAGAAGCAAACTACAAAGAAGAAAAAGAAGCAAAAGAAGAAGGCATTAATAAA^^ 

TTAATGATTTAAGAAATTTAATAGAAACAGCTAAAAAAGATAATGATAAATATACACAAAAGTTAAAAGAA 

CTCAAGCCAATACGGAATACTGGCTTTCAAAGATTTGTTCTGGCTAGATGGAACAAATGAACAATTGTCCGCAAAT 

AC CGAAAGATC TAAAGC CTAT AG AAAACG AGCTT ATAGC ATCTTAAAT AC TATT AATGAC GC TTCCTT AAAGAATT 

TTTCAGAAATTGTAATGGCATCAGGACAAACACAGGGCATATTTAATACCCTTAACTCACTTGGGGGTAATTTTGA 

AAAGATAGTTAATTGTTTGTATCCCAAAAAAGACAATTTGGAAAAATTAGAGACTTCAGTTTTAAAAAAGCTTAAA 

GATTCTTTGGAAAATTTTTTAGAGATAAAAAAAATCGCCTCAGAAATGATGCACAAGCTCTTATTAGACTATCAAA 

ATAATACAAATCGTATACAAACAGATAAAJU^TGAACTTAAGTCTTATGCAGACACACTTTTCAATCAAATGA 

AAAACCCGAAGAAGCACTAAAG 
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fl9-2.aa 

RKIKSYSRRV FMKHYIIVHI FVFLFLNACY PVASNKIELK PKTETSLNQE EVPNQ EANYK 

EEKEAKEEGI NKKTENTLLN DLRNL I ETAK KDNDKYTQKL KEESSSQYGI LAFKDLFWLD 

GTNEQLSANT ERSKAYRKRA YSILNTINDA SLKWFSEIVM ASGQTQGIFN- TLNSLGGNFE 

KIVNCLYPKK DNLEKLETSV LKKLKDSLEN FLEIKKIASE MMHKLLLDYQ NNTNRIQTDK 

NELKSYADTL FNQMTKKPEE ALKLKNTICS IEDL . 

tl9-2.aa 

CYPVASNKIELKPKTETSLNQEEVPNQEAOTKEEKEAKEEGINKKTENTLLNDLRNLIETAKKDNDKYTQKLKEES 
SSQYGILAFKDLFWLDGTNEQLSANTERSKAYRKRAYSILNTINDASLKNFSEIVMASGQTQGIFNTLNSLGGNFE 
KIVTCCLYPKKDNLEKLETSVLKKLKDSL^ 



m 


KPEEALK 












m 


f 19-4-nt 












y 


TAATCTATAC 


TAATTGAGGA 


GAATATTTTT 


ATGAAAAACA 


ACATAATTTT 


ATGCATGTGT 


(21 


GTTTTTTTAC 


TTTTAAATAG 


CTGCACCGCT 


AACCATGAAG 


CTGAAGCGAA 


AATAAAAAAA 




CATGT-TGATA 


AAACAAAAAA 


CGAATATATT 


AATGAAATAA 


AAAATTTAAT 


AGCAACAACC 


y 


AAAGAAATCA 


TCGAAAAACG 


AAAATTGCTA 


CAAGCTAAAC 


CAGTAGATCA 


AAACCCCGTA 




GATGATACAA 


ACAATAAGAA 


AGTTTTCGAG 


ATAGATAAAA 


GAGCTTTCGA 


TTTTATAAAT 


S;. 


AGTTTTTTAA 


CAGATGATGA 


ATTTAATAAA 


TTTGTAACAA 


TATTTCATAA 


ACCAACACTA 


h 


AAATCACCCG 


GAAAAGTATT 


AAATAGCATA 


GCAATTCTAG 


AGCTAAACAT 


AGAGCAGGTA 


® 


ATTAATCACC 


T AGAC TC AAA 


AAAT GAG AC C 


TTAAATAAAG 


CAAGCTCTTT 


AGATTTGGAA 


m 


AAGATCAAAA 


ATTCCCTTGA 


ACAGCTGTTC 


TCTATAAGGA 


ATTTTTTTTC 


AACAATCATA 


si 


AAAAGGGTCT 


TATTAGATCA 


TCAAAACAAT 


GAAAATTCTA 


TAAAACCAGA 


. TGATTCTAAA 


m 


. TCAGGAACCT 


ATTTCGATAC 


GATATACGAT 


CAGTTTAATG 


AAAAAAATAA 


AGAGGTTAGA 




AATCTGAAAA 


AAACCATATT 


ATCACTGCCG 


AATTAA • 








t!9-4 .nt 













CTGCACCGCTAACCATGAAGCTGAAGCGAAAATAAAAAAACATGTTGATAAAACAAAAiyVCGAATATATTAATGAA 
ATAAAAAATTTAATAGCAACAACCAAAGAAATCATCGAAAAACGA 

ACCCCGTAGATGATACAAACAATAAGAAAGTTTTCGAGATAGATAAAAGAGCTTTCGATTTTATAAATAGTTTTTT 
AACAGATGATGAATTTAATAAATTTGTAACAATATTTCATAAACCAJ^CACTAAAATCACCCGGAAAAGTATTAAAT 
AGCATAGCAATTCTAGAGCTAAACATAGAGCAGGTAATTAATCACCTAGACTCAAAAAATGAGACCTTAAATAAAG 
CAAGCTCTTTAGATTTGGAAAAGATCAAAAATTCCCTTGAACAGCTGTTCTCTATAAGGAATTTTTTTTCAACAAT 
CATAAAAAGGGTCTTATTAGATCATCAAAACAATGAAAATTCTATAAAACCAGATGATTCTAAATCAGGAACCTAT 
TTCGATACGATATACGATCAGTTTAATGAAAAAAATAAAGAGGTTAGAAATCTGAAAAAA. 

fl9-4.aa ' 

SILIEENIFM KNNIILCMCV FLLLNSCTAN HEAEAKI KKH VDKTKNEYIN EIKNLIATTK 

EIIEKRKLLQ AKPVDQNPVD DTNNKKVFEI DKRAFDFINS FLTDDEFNKF VTIFHKPTLK 

SPGKVLNSIA ILELNIEQVI NHLDSKNETL NKASSLDLEK IKNSLEQLFS IRNFFSTIIK 

RVL. l dhqnne nsikpddsks GTYFDTIYDQ fneknkevrn lkktilslpn - 
tl9-4 .aa 

CTANKEAEAKI KKHVDKTKNE Y INE I KNL I ATTKE X I EKRKLLQ AK P VDQNPVDDTNMKKVFEIDKFJ^FDFINSFL 

TDDEFNKFVTIFHKPTLKSPGKVLNSIAILELNIEQVINHL^ 

IKRVLLDHQNNENSIKPDDSKSGTYFDTIYDQFNEKNKEVPjNLKX 



f 19-6 .nt 
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TAAAGGAGAG TATTAATGAA ATGCCATATA ATTGCAACTA TATTTGTTTT TCTATTTTTA 
GCTTGCAGTA- CAGATTTTAA TACTGATCAA AAAGGCATTA AATACCCGCC TACCGAAAAA 
TCAAAGCCCA AAACTGAAGA CTCTAAGCAA AAAGAATTAA AGCCTAAAAC AGAAAAAGAA 
CTAAAGAAAA- AACAACAACT AAAAAATAAA CTACTTAATG ATTTAAAAAA TTCAATAGAA 
AC AGC TAAT A AGCATAAAGA AAAGTATAAA AAAAGAATGA AAGAAGAACC CGAAGATCAA 
TACGGGGTAC AGGCTTTCAA AGGATCGAAT TGGGGGCCGG GGACTGAAGA TGTATCTGCC 
AACACCGAAA GATCTATAAG ATTTAGAAGA CATACTTATA CTATTTTAAG CACGCTGAGT 
CTTCATGAAT TAAAGGAATT CTCAAATATT GTTACAAATG AAAATAAACT GGTGCCAGTA 
GTAGATATGT TTAATTTCTT TAGCTCTATT GGGACAGCTC TTGATATAAC AACCGATAGC 
TTATATCCCA AAAAGACAAT CTGGACAAAC CAGATCTGTC GGATTTAG 

tl9-6.nt 

TTGCAGTACAGATTTTAATACTGATCAAAAAGGCATTAAATACCCGCCTACCGAAAAATCAAAGCCCAAAACTGAA 

GACTCTAAGCAAAAAGAATTAAAGCCTAAAACAGAAAAAGAACTAAAGAAAAAACAACAACTAAA 

TTAATGATTTAAAAAATTCAATAGAAACAGCTAATAAGCATAAAGAAAAGTATAAAAAAAGAATGAAAGAAGAAC 

CGAAGATCAATACGGGGTACAGGCTTTCAAAGGATCGAATTGGGGGCCGGGGACTGAAGATGTAtCTGCCAACACC 

GAAAGATCTATAAGATTTAGAAGACATACTTATACTATTTTAAGCACGCTGAGTCTTCATGAATTAAAGGAATTCT . 

CAAATATTGTTACAAATCAAAATAAACTGGTGGCAGTAGTAGATATGTTTAATTTCTTTAGCTCTATTGGGACAGC 

TCTTGATATAACAACCGATAGCTTATATCCCAAAAAGACAATCTGGACAAACCAGATCTGTCGG 

"fl9-6.aa 

. RRVLMKCHII ATIFVFLFLA CSTDFNTDQK GIKYPPTEKS KPKTEDSKQK. ELKPKTEKEL 
KKKQQLKNKL LNDLKNSIET ANKHKEKYKK RMKEEPEDQY GVQAFKGSNW GPGTEDVSAN 
TERS IRFRRH TYTILSTLSL HELKEFSNIV TNENKLVPW DMFNFFSSIG TALDITTDSL 
YPKKTIWTNQ ICRI 

tl9-6.aa 

CSTDFNTDQKGIKYPPTEKSKPKTEDSKQKELKPKTEKELKKKQQLKNKLLNDLKNSIETANKHKEKYKKRMKEEP 
EDQYGVQAFKGSNWGPGTEDVSANTERS IRFRRHTYT I LSTLSLHELKEFSNI VTNENKLVPWDMFNFFSS IGTA 
LDITTDSLYPKKTIWTNQIGR 

f21-4 .nt 

TAGGAGACAA TCTTTATGAA TAAAAAAATA AAAATGTTTA TTATTTGTGC TATTTTTATG 

CTGATAAGTT CTTGTAAGAA TGATGTAACT AGTAAAGATT TAGAAGGGGC GGTGAAAGAT 

TTAGAAAGTT CAGAACAAAA TGTAAAAAAA ACAGAACAAG AGATAAAAAA AGAAGTTGAA 

GGATTTTTAG AAATTTTAGA GACAAAAGAT TTAAACACAT TAGATACAAA AGAAATTGAA ^ 

AAACAAATTC AAGAATTAAA GAATAAGATA GAAAAATTAG ACTCTAAAAA AACTTCTATT 

GAAACATATT CTGGGTATGA AGAAAAAATA AACAAAATAA AAGAAAAATT AAACGGAAAA 

GGACTTGAAG ATAAATTAAA TGAACTTTCA GAG AGC TT AA AAAAGAAAAA AGAGGAGAGA 

AAAAAAGCTT TACAAGAGGC TAAAAAGAAA TTTGAAGAGT ATAAAAACCA AGCTGAATCT 

GCAACTGGAG TAACGCATGG TTCTCAAGTC CAAAGACAAG GTGGTGTTGG ATTACAAGCT 

TGGC AGTGTG CTAATAGTTT GGGGTTTAAA AATATGACTA GTGGTAATAA- TAG TAGCG AT 

ATGACCAATG AAGTTATAAC TAATTCGCTT AAAAAGATTG AAGAAGAACT TAAAAATATT 

GGAGAAACTG TAGAAGGTAA AAAAGAATAA 

t21-4.nt 

TTGTAAGAATGATGTAACTAGTAAAGATTTAGAAGGGGCGGTGAAAGATTTAGAAAGTTCAGAACAAAATGTAAAA 
AAAACAGAACAAGAGATAAAAAAACAAGTTGAAGGATTTTTAGAAATTTTAGAGACAAAAGA.^ 

AT AC AAAAG AAATTGAAAAAG AAATTC AAG AATT AAAGAATAAG AT AGAAAAATT AGAC TC T AAAAAAACTTC TAT 
TGAAAC AT ATTC TGGGT ATGAAG AAAAAAT AAAC AAAAT AAAAG AAAAATTAAAC GGAAAAGGAC TTGAAGAT AAA 
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TTAAATGAACTTTC AGAGAGC TT AAAAAAGAAAAAAGAGGAGAG AAAAAAAGC TTT AC AAG AG&C TAAAAAGAAAT 
TTGAAGAGTATAAAAACCAAGCTGAATCTGCAACTGGAGTAACGCATGGTTCTCAAGTCCAAAGACAAGGTGGTGT 
TGGATTACAAGCTTGGCAGTGTGCTAATAGTTTGGGGTTTAAAAATATGACTAGTGGTAATAATACTAGCGATATG 
ACCAATGAAGTTATAACTAATTCGCTTAAAAAGATTGAAGAAGAACTTAAAAATATTGGAGAAACTGTAGAAGGTA 

AAAAAGAA 

f21-4.aa . ' 

ETIFMNKKIK MFIICAIFML ISSCKNDVTS KDLEGAVKDL ESSEQNVKKT EQEIKKQVEG 
FLEILETKDL NTLDTKEIEK QIQELKNKIE KLDSKKTSIE TYSGYEEXIN KIKEKLNGKG * 
LEDKLNELSE SLKKKKEERK KALQEAKKKF EEYKNQAESA TGVTHGSQVQ RQGGVGLQAW 
QCANSLGFKN MTSGNNTSDM TNEVITNSLK' KIEEELKNIG ETVEGKKE 

t21-4.aa _ 

CKNDVTSKDLEGAVKDLESSEQNVKKTEQEIKKQVEGFLEILETKDLNTLDTKEIEKQIQELKNKIEKLDSKKTSI 
' ETYSGYEEKINKIKEKLNGKGLEDKLNELSESLKKKKEERKKALQEAKKKFEEYKNQAESATGVTHGSQVQRQGGV 
GLQAWQCANSLGFKNMTSGNNTSDMTNEVITNSLKKIEEELfCNIGETVEGKKE 

- f 24-1 .nt ' 

TAAGCTGGTA ACACTGTAAA GACAGCTGAG GGGGCTTCAA GTGGTACTGA TGCAATTGGA 
GAAGTTGTGG ATAATGATGC TAAGGTTGCT " GATAAGGCGA GTGTGAC GGG GATTGCTAAG 
GGGATAAAGG AGATTGTTGA AGCTGCTAGG GGGAGTGAAA AGCTGAAAGT TGCTGCTGCT 
AAAG AGGGC A ATGAAAAGGC , AGGGAAGTTG TTTGGGAAGG CTGGTGCTAA TGCTCATGGG 
GACAGTGAGG CTGCTAGCAA GGCGGCTGGT GCTGTTAGTG CTGTTAGTGG GGAGCAGATA ' 
TTAAGTGCGA TTGTTAAGGC TGCGGATGCG GCTGAGCAGG ATGGAAAGAA GCCTGCAGAT 
GCTACAAATC CGATTGCTGC TGCTATTGGG' AATAAAGATG AGGATGCGGA TTTTGGTGAT - 
GGGATGAAGA AGGATGATCA GATTGCTGCT GCTATTGCTT ■ TGAGGGGGAT GGCTAAGGAT 
GGAAAGTTTG CTGTGAAGAA TGATGAGAAA GGGAAGGCTG AGGGGGCTAT TAAGGGAGCT 
GCTGCAATTG GAGAAGTTGT GGATAATGCT GGTGCTGCGA AGGCTGCTGA TAAGGATAGT 
GTGAAGGGGA TTGCTAAGGG GATAAAGGAG ATTGTTGAAG CTGCTGGGGG - GAGTGAAAAG 
CTGAAAGCTG CTGCTGCTGA AGGGGAGAAT AATAAAAAGG CAGGGAAGTT GTTTGGGAAA 
GTTGATGGTG CTGCTGGGGA CAGTGAGGCT GCTAGCAAGG CGGCTGGTGC TGTTAGTGCT 
GTTAGTGGGG AGCAGATATT AAGTGCGATT GTTAAGGCTG CTGGTGAGGC TGAGCAGGAT 
GGAGAGAAGC CTGAGGATGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA - GGGT AATGGG 
GATGGTGCGG AGTTTGATCA GGATGAGATG AAGAAGGATG ATCAGATTGC TGCTGCTATT 
GCTTTGAGGG GGATGGCTAA GGATGGAAAG . TTTGCTGTGA AGGGTAATAA TGAGAAAGAG 
AAGGCTGAGG GGGCTATTAA AGAAGTTAGC GAGTTGTTGG ATAAGCTGGT AACAGCTGTA 
AAGACAGCTG AGGGGGC TTC AAGTGGTACT GATGCAATTG GAGAAGTTGT GGATAATGNT 
GCNAAGGNTG CTGATAAGGC GAGTGTGACG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTN GGGGGAGTGA AAAGC TG AAA GTTGCTGCTG C T AN AGNGGN NAATAATAAA 
GAGGCAGGGA AGTTGTTTGG GAAGGCTGGT GCTGATGCTA ATGGGGACAG TGAGGCTGCT 
XGCAAGGCGG CTGGTGCTGT TAGTGCTGTT AGTGGGGAGC AGATATTAAG TGCGATTGTT- 
AAGGCTGCGG CTGCTGGTGC GGCTGATCAG GATGGAGAGA AGCCTGGGGA TGCTAAAAAT 
CCGATTGCTG CTGCTATTGG GAAGGGTAAT GCGGATGATG GTGCGGATTT TGGTGATGGG 
ATGAAGAAGG ATGATCAGAT TGCTGCTGCT ATTGCTTTGA GGGGGATGGC TAAGGATGGA 
AAGTTTGCTG " TGAAGAAGGA r TGAGAAAGGG AAGGCTGAGG GGGCTATTAA GGGAGCTAGC 
GAGTTGTTGG ATAAGCTGGT AAAAGC TGT A AAGACAGCTG AGGGGGCTTC AAGTGGTACT 
GCTGCAATTG GAGAAGTTGT GGATAATGCT GCGAAGGCTG CTGATAAGGA TAGTGTGACG 
GGGATTGCTA AGGGGATAAA GGAGATTGTT GAAGCTGCAG GGGGGAGTGA AAAGC TG AAA 
GTTGCTGCTG C T AAAGGGG A G AAT AAT AAA GGGGCAGGGA AGTTGTTTGG GAAGGCTGGT 
GCTAATGCTC ATGGGGACAG TGAGGCTGCT AGCAAGGCGG CTGGTGCTGT TAGTGCTGTT 
AGTGGGGAAC AGATATTAAG TGCGATTGTT AAGGCTGCTG GTGAGGCTGC TGGTGATCAG 
GAGGGAAAGA AGC CTGAGGA GGCTAAAAAT CCGATTGCTG CTGCTATTGG GGATAAAGAT 
GGGGATGCGG AGTTTAATCA GGATGGGATG AAGAAGGATG ATCAGATTGC TGCTGCTATT 
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GCTTTGAGGG GGATGGCTAA GGATGGAAAG TTTGCTGTGA AGGATGGTGG TGAGAAAGAG 
AAGGCTGAGG GGGCTATTAA AGGAGTTAGC GAGTTGTTGG ATAAGCTGGT- AAAAGCTGTA 
AAGACAGCTG AGGGGGC TTC AAGTGGT AC T GCTGCAATTG GAGAAGTTGT GGCTGATGCT- 
GCTAAGGTTG C TG AT AAGGC GAGTGTGACG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTG GGGACAGTGA GGCTGCTAGC AAGGC AGCTG GTGCTGTTAG TGCTGTTAGT 
GGGGAGCAGA TATTAAGTGC GATTGTTAAG GCTGCGGCTG CTGGTGCGGC TGAGCAGGAT 
GGAGAGAAGC CTGCAGAGGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA GGGTGATGGG 
GATGCGGATT TTGGTGAGGA TGGGATGAAG AAGGATGATC AGATTGCTGC TGCTATTGCT 
TTGAGGGGGA TGGCTAAGGA TGGAAAGTTT GCTGTGAAGA ATGATGAGAA AGGGAAGGCT 
GAGGGGGCTA TTAAGGGAGC TGCTGCAATT GGAGAAGTTG TGGATAATGC TGGTGCTGCG 
AAGGC TGCTG ATAAGGATAG TGTGAAGGGG ATTGCTAAGG GGATAAAGGA GATTGTTGAA 
GCTGCTGGGG GGAGTGAAAA GCTGAAAGCT GCTGCTGCTG AAGGGGAGAA TAATAAAAAG 
GCAGGGAAGT TGTTTGGGAA AGTTGATGGT GCTGCTGGGG ACAGTGAGGC TGCTAGCAAG 
GCGGCTGGTG CTGTTAGTGC TGTTAGTGGG GAGCAGATAT TAAGTGCGAT TGTTAAGGCT 
GCGGATGCGG CTGAGCAGGA TGGAAAGAAG CCTGCAGATG CTACAAATCC GATTGCTGCT 
GCTATTGGGA ATAAAGATGA GGATGCGGAT TTTGGTGATG GGATGAAGAA GGATGATCAG 
ATTGCTGCTG CTATTGCTTT GAGGGGGATG GCTAAGGATG GAAAGTTTGC TGTGAAGGGT 
AATAATGAGA AAGGG AAGGC TG AGGGGGC T TC AAGTGGT A ' CTGATGCAAT TGGAGAAGTT 
GTGGATAATG ATGCGAAGGC TGCTGATAAG GCGAGTGTGA CGGGGATTGC TAAGGGGATA 
AAGGAGATTG TTGAAGCTGC TGGGGGGAGT GAAAAGCTGA AAGCTGTTGC TGCTGCTACA 
AGGGAGAATA ATAAAGAGGC AGGGAAGTTG TTTGGGAAAG TTGATGATGC TCATGCTGGG 
GACAGTGAGG CTGCTAGCAA GGCGGCTGGT GCTGTTAGTG CTGTTAGTGG GGAGCAGATA 
TTAAGTGCGA TTGTTACGGC TGCGGCTGCT GGTGAGCAGG ATGGAGAGAA GCCTGCAGAG 
GCTACAAATC CGATTGCTGC . TGCTATTGGG AAGGGTAATG AGGATGGTGC GGATTTTGGT 
AAGGATGAGA TGAAGAAGGA TGATCAGATT GCTGCTGCTA TTGCTTTGAG . GGGGATGGCT 
AAGGATGGAA AGTTTGCTGT GAAGAGTAAT GATGGTGAGA AAGGGAAGGC TG AGGGGGC T 
ATTAAGGAAG TTAGCGAGTT GTTGGATAAG CTGGTAAAAG CTGTAAAGAC AGCTGAGGGG 
GCTTCAAGCG GTACTGATGC AATTGGAGAA GTTGTGGCTA ATGC TGGTGC TGCGAAGGCT 
GCTGATAAGG CGAGTGTGAC GGGGATTGCT AAGGGGATAA AGGAGATTGT TGAAGCTGCT 
GGGGGGAGTA AAAAG C TG AA AGCTGCTGCT GCTGAAGGGG AGAATAATAA AAAGGCAGGG 
AAGTTGTTTG GG AAGGC TGG TGCTGGTGCT GGTGCTAATG GGGACAGTGA GGCTGCTAGC 
AAGGC GGCTG GTGCTGTTAG TGC TGGTT AG 

t24-l.nt * ' 

TGGTGAGGCTGAGCAGGATGGAGAGAAGCCTGAGGATGCTAAAAATCCGATTGCTGCTGCTATTGGGAAGGGTAAT 
GGGGATGGTGCGGAGTTTGATC AGGATGAGATGAAGAAGGATG ATC AGATTGC TGC TGC TAT TGC T TTGAGGGGGA 
TGGCTAAGGATGGAAAGTTTGCTGTGAAGGGTAATAATGAGAAAGAGAAGGCTGAGGGGGCTATTAAAGAAGTTAG 
CGAGTTGTTGGATAAGCTGGTAACAGCTGTAAAGACAGCTGAGGGGGCTTCAAGTGGTACTGATGCAATTGGAGAA 
GTTGTGGATAATGNTGCNAAGGNTGCTGATAAGGCGAGTGTGACGGGGATTGCTAAGGGGATAAAGGAGATTGTTG' 
AAGCTGCTNGGGGGAGTGAAAAGCTGAAAGTTGCTGCTGCTANAGNGGNNAATAATAAAGAGGCAGGGAAGTTGTT 
TGGGAAGGCTGGTGCTGATGCTAATGGGG ACAGTGAGGC TGCTAGCAAG . 

f24-l.aa 

AGNTVKTAEG ASSGTDAIGE WDNDAKVAD KASVTGIAKG IKEIVEAARG SEKLKVAAAK 
EGNEKAGKLF GKAGANAHGD SEAA3KAAGA VSAVSGEQIL SAIVKAADAA EQDGKK P ADA 
TNP I AAAIGN KDEDADFGDG MKKDDQ I AAA IALRGMAKDG KFAVKNDEKG KAEGAI KGAA 
AIGEWDNAG AAKAADKDSV KGIAKGIKEI VEAAGGSEKL KAAAAEGENN KKAGKLFGKV 
DGAAGDS EAA SKAAGAVSAV SGEQIL3AIV KAAGEAEQDG EKPEDAKNPI AAA I GKGNGD 
GAEFDQDEMK KDDQIAAAIA LRGMAKDGKF AVKGNNEKEK AEGAIKEVSE LLDKLVTAVK 
'TAEGASSGTD AIGEWDNXA KXADKASVTG IAXGIKEIVE 'AAXGSEKLKV AAAXXXNNKE 
AGKLFGKAGA DANGDSEAAS- KAAGAVSAVS GEQILSAIVK AAAAGAADQD GEKPGDAKi^? 
IAAAIGKGNA DDGADFGDGM KKDDQ I AAA I ALRGMAKDGK" FAVKKDEKGK AEG A I KG AS E 
LLDKLVKAVK TAEGA5 SGTA AIGEWDNAA KAADKDSVTG IAXGIKEIVE AAGGSEKLKV 
AAAKGENNKG AGKLFGKAGA NAHGDSEAAS KAAGAVSAVS GEQILSAIVK AAGEAAGDQE 
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GKKPEEAKNP I AAA I GDKDG DAEFNQDGMK KDDQIAAAIA LRGMAKDGKF AVKDGGEKEK r 

AEGAIKGVSE LLDKLVKAVK TAEGASSGTA A I G E WAD AA KVADKASVTG IAKGIKEIVE 

AAGDSEAASK AAGAVSAVSG EQILSAIVKA AAAGAAEQDG EKPAEAKNPI AAA IGKGDGD 

ADFGEDGMKK DDQIAAAIAL RGMAKDGK F A VKND EKGKAE GAIKGAAAIG EWDNAGAAK 

AADKDS VKG I AKGIKEIVEA AGGSEKLKAA AAEGENNKKA GKL FGKVDGA AGDSEAASKA 

AGAVSAVSGE QILSAIVKAA DAAEQDGKKP ADATNP I AAA IGNKDEDADF GDGMKKDDQI 

AAA I ALRGMA KDGKFAVKGN NEKGKAEGAS SGTDAIGEW DNDAKAADKA S VTG I AKG I K . 

EIVEAAGGSE KLKAVAAATR ENNKEAGKLF GKVDDAHA r GD SEAASKAAGA VSAVSGEQIL 

S A I VTAAAAG EQDGEKPAEA TNPIAAAIGK GNEDGADFGK DEMKKDDQIA AAIALRGMAK 

DGKFAVKSND GEKGKAEGAI KEVSELLDKL VKAVKTAEGA SSGTDAIGEV VANAGAAKAA 

DKASVTGIAK GIKEIVEAAG GSKKLKAAAA EGENNKKAGK LFGKAGAGAG ANGDSEAASK 
AAGAVSAG ' * 

t24-l .aa - 

GEAEQDGEKPEDAKNPIAAAIGKGNGDGAEFDQDEMKKDDQ 
ELLDKLVTAVKTAEGASSGTDAIGEVVDNXAKXADKASVT^ 
GKAGAD ANGDSEAASK 

• f2 8-2 .nt 

TAAAAAGGAA ATATAAATAT TATGCGATTA TGTTTAATAA AAATTTTTAT TATACCTAAT 

TTAGTATTTA GTTCTCTTTT TTTATTTGAA AGTTGTTCTG GTTTTCTATC TAAAAAATCT 

ATAGAACAGT TTGCATTAGC ATTAAAAGAT CATCAAGAAA ATAAAAATAC TACTAATACT 

TCAGTAGATA AAAATAGTAA GGAAATTGAA TCTCCTAAAG . ACGTTACATC . ATC AAATAAA 

AAAACTTATG ATCCAATCTT ACAAGTAGGT TCTAATCAAC ATATGTCAGA TGATCCTGGT 

GCTAATAATA AAGAATCCCT ACCAAATTCA - AGTCCAGCAA TAATACAAAA TGACTCGCAT 

GCTCAAAATA. ATGTAAAGAT GGAAGAAAAT AAATCAGCTA CTCCACAACA TGATCCAATT 

GAACAAAGTA ATTTTAAAAA TAGCCTTACT ACAACAAGTA AAACTCCTGC TATTCCTTCA 

GAAGAAGAAA TTAAAGCTAA CTTAGATGAA TTTGCACAAG AAGAGTATGA GCAAACATCT 

CTTTCAGAAA TTAAAAATGC CACGCAAATT GTTAATCATG CTAATCCTGA AAAC AAATT A 

AACAATACAC TCCTTGAGTT TGAAAAAGAT TATGAAACTT T ATC AAAC TT GTTATTCTCT 

AATTTAGACG CATCTCCTTT GAATAGAAAA ATAAAGACTA TTATGCCTAA ATTACAAGAA 

ATGCGTTCTT TTATGGAGCA AGCAACTAAT TCTTGGGTAT CTGCTAAAGG CATGCTAGAT 

GAGGCTAAGG . ATAAACTAGC AGAATCTATT TATAAAAGAC TATACAATGG CAATTCATAC 

CGGTTCGGTG GCAGTTTTAA CGGACGTGAT ATGCAACATG CAAAAAATTT AGCATACAGA 

GC TAT AG AC T TTGCTTCTGC ATGCATTGAA TATACACAAA AAGCTATTGA TTATCTTCAA 

CAGGGAAATT CTTGCAAAAA AGAAATAGAA AATATATTCA AGCTTTAA 

t28-2.nt 

AAAAGATCATCAAGAAAATAAAAATACTACTAATACTTCAGTAGATAAAAATAGTAAGGAAATTGAATCTCC^ 

GACGTTACATCATCAAATAAAAAAACTTATGATCCAATCTTACAAGTAGGTTCTAATCAACATATGTCAGATGATC 

CTGGTGCTAATAATAAAGAATCCCTACCAAATTCAAGTCCAGCAATAATACAAAATGACTCGCATGCTCAAAATAA 

TGTAAAGATGGAAGAAAATAAATCAGCTACTCCACAACATGATCCAATTGAACAAAGTAATTTTAAAAATAGC 

ACTACAACAAGTAAAACTCCTGCTATTCCTTCAGAAGAAGAAATTAAAGCTAACTTAGATGAATTTGCACAAGAAG 

AGTATGAGCAAACATCTCTTTCAGAAATTAAAAATGCCACGCAAATTGTTAATCATGCTAATCCTGAAAACAAATT 

AAACAATACACTCCTTGAGTTTGAAAAAGATTATGAAACTTTATCAAACTTGTTATTCTCTAATTTAGACGCATCT 

C C TTTGAAT AGAAAAATAAAG AC T ATT ATGC CT AAATT AC AAGAAATGCGTTC TTTTATGGAGCAAGCAAC T AATT 

C TTGGGTATC TGCT AAAGGC ATGC T AGATG AGGC T AAGGATAAAC TAGC AGAATCT ATTT ATAAAAGACT AT AC AA 

TGGCAATTCATACCGGTTCGGTGGCAGTTTTAACGGACGTGATATGCAACATGCAAAAAATTTAGCATACAGAGCT 

ATAGACTTTGCTTCTGCATGCATTGAATATACACAAAAAGCTATTGATTATCTTCAACAGGGAAATTCTTGCAAAA 

AAGAAATAGAAAATATATTCAAG 



f23-2 .aa 
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KGNINIMRLC LIKIFIIPNL VFSSLFLFES CSGFLSKKSI EQFALALKDH QENKNTTNTS 
VDKNSKEIES PKDVTSSNKK TYDPILQVGS NQHMSDDPGA " NNKESLPNSS PAIIQNDSHA 
QNNVKMEENK SATPQHDPIE QSNFKNSLTT TSKTPAIPSE EEIKANLDEF AQEEYEQTSL 
SEIKNATQIV NHANPENKLN NTLLEFEKDY ETLSNLLFSN LDASPLNRXI KTIMPKLQEM 
RSFMEQATNS WVSAKGMLDE AKDKLAESIY KRLYNGNSYR FGGSFNGRDM QHAKNLAYRA 
IDFASACIEY TQKAIDYLQQ GNSCKKEIEN IFKL 

t28-2.aa . ■ - 

KDHQENKNTTNTSVDKNSKEIESPKDVTSSNKKTYDPILQVGSNQHMSDDPGANNKESLPNSSPAIIQNDSHAQNN 
VKMEENKSATPQHDPIEQSNFKNSLTTTSKTPAIPSEEEIKANLDEFAQEEYEQTSLSEIKNATQIVNHANPENKL 
NNTLLEFEKDYETLSNLLFSNLDASPLNRKIKTIMPKLQEMRSFMEQATNSWVSAKGMLDEAKDKLAESIYKRLYN 
GNSYRFGGSFNGRDMQHAKNLAYRAIDFASACIEYTQKAIDYLQQGNSCKKEIENIFK - 

£23-3. nt • - 

TAGATGAATT TAATTGCTAA ATTATTTATT TTATCCACTT TAGTTTCAAT TCCAAATATC 

CTCTCTTGTA AC C T AT ATG A TAATCTTGCA GACAACGCTG AGCAGGTTAC AGACATACTA 

GACAACAACA AGTCTTTTAA TACTTTAGGA AGCAGCAATG AGAGTAGAAG TCGCAGGCCT 

AGAAGTACAA ATAATGCTTA TATGAAACAA -AACATAGACA AAAATCATTT AGTTGTTGCA 

GATATGCAAA ATGATAATAG TAGCAGCAGT CTTCCCCAAC AAGTTAATAG TGAATCCAGT 

AAAGCTAATG AAGATAGTAA TATTATGAAG GAAATTGAAT CTTCTACAGA AGAGTGCGCT 

AGACTAAGAA AAGATTTAGA *AACTATAAAA CAAATACTTG ATAATATAGA AAGCTTGCTT 

AATACAGCTA ATTCTTATTT AGAGAACGCT AGAAAAGCAC CTAAATCTAA TCAAGATAAT 

CAAACCTTAT TGCTTAGCCT GCACCAAGCT ATTGCTAAGG TTAAGAGTAG TCATACTTCT 

TTTATCATTT GTTATAATGA TGCATTTAAT TCCCTGGGAA . TAGCTGATAC TGCCTTTAAA 

GATGCAAAGA GAAAGGCAGT TGAGGCATAA 

t28-3.nt 

TTGTAACCTATATGATAATCTTGCAGACAACGCTGAGCAGGTTACAGACATACTAGACAACAAGAAGTCTTTTAAT 
ACTTTAGGAAGCAGCAATGAGAGTAGAAGTCGCAGGCCTAGAAGTACAAATAATGCTTATATGAAACAAAACATAG, 
"ACAAAAATCATTTAGTTGTTGCAGATATGCAAAATGATAATAG.TAGCAGCAGTCTTCCCCAACAAGTTAATAGTGA 
ATC C AGT AAAGC T AATGAAG AT AGT AAT ATT ATG AAGG AAATTG AATC TTCT AC AGAAGAGTGC GC T AGAC T AAG A 
AAAGATTTAGAAACTATAAAACAAA'TACTTGATAATATAGAAAGCTTGCTTAATACAGCTAATTCTTATTTAGAGA 
ACGCTAGAAAAGCACCTAAATCTAATCAAGATAATCAAACCTTATTGCTTAGCCTGCACCAAGCTATTGCTAAGGT 
TAAGAGTAGTCATACTTCTTTTATCATTTGTTATAATGATGCATTTAATTCJCCTGGGAATAGCTGATACTGCCTTT 
AAAGATGCAAAGAGAAAGGCAGTTGAGGCA 

"f28-3.a"a 

MNLIAKLFIL STLVSIPNIL SCNLYDNLAD NAEQVTDILD NNKSFNTLGS SNESRSRRPR 
STNNAYMKQN I DKNHLWAD MQNDNSSSSL PQQVNSESSK ANEDSNIMKE IESSTEECAR 
LRKDLETIKQ ILDNIESLLN TANSYLENAR KAPKSNQDNQ TLLLSLHQAI AKVKSSHTSF 
IICYNDAFNS LGIADTAFKD AKRKAVEA 

t28-3.aa 1 * 

CNLYDNLADNAEQVTDILDNNKSFNTLGSSNESR 

SSKANEDSNIMKEIESSTESCARLRKDLETIKQILDNIESLLNTANSYLENARKAPKSNQDNQTLLLSLHQAIAKV 
KSSHTSFIICYNDAFNSLGIADTAFKDAKRKAVEA 

f31-2.nt ■ 

TAAAAAAATA AGGAGGTATT AATGAAAAGG AAAAGCAATA TATGTATTTC ACTTCTAGTC 
ACAATATTAT TTGTGTCTTG CAAGTTTTTT GGAAATAAAA GCGCAAGTAA AGAAAAAGAA 
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GAAACTTCTT TTTCTGATAC TGCTAGCAAG ATTAGTAAGT CGGGAACAGC TGCTTCTTCA 
GACAAACAAG AAAAAAATAC AAGTGATGTT ACAGGTGACG CCAAAAAGCA TACTAGTAGC 
CCTTACATGC TTGCTGATGC CCTTATTGTT AGTG AT AC T A CTAATAGAGA TAGAGATAAG 
CAAGAAAATA AAGATAAATT AAATGAAGAA GATAAAAAAA AGCTTAATGC TTTTTTTAGC 
ACAACTAAAA CATATCAATC TAGC C TAG AT TCCATTTATA AC AAAT AT AC AGGCTATTAT 
AATACCATTG AT AC CT AT GG CAGCTGTGAT ACGTATCGCA TTGAGTGTTT TAGTGTAGGA 
CCTTCTGAAA AACGTAAACA AGCTCTTGCT GATCTAGAGA AGTTAAAACT AGACGAAAAG 
TACACTOAGC TTAGCACAAT GTTAAAGAGT GCTGTGCCTA GTTATTACAA AAAAAATTTA 
GATGATTCTA TTGCACAGTA TAAGGAAGCC ATAAAGCAGG CTATTGAAGC TGAAAGTAAA 
ATAGAGACAG T AAAAG AC T A TGCAACAGCT CAAAGTGCTG CCGATGACGA AAAGAAAAGA 
AATATAGATA ATTTAAAAAT AGTTAGAGAT GTTCTTCTTA TTATTAAAAA AACTATTGAG 
AAAGCCAGCC GATCTTATGC TGATGCTTTT GCTATTGCAA CATCTAGCTT ATCTTGTAGC 
GAATTTAAGC AAGCTGTTAA AGAGTTTAAT ' GATGCTGC T A. AACAATATGC TAATGGAAAT 
AAAGGAGACA ATGCTGTCAA TGTTATTGTA GGCACTATTT CTAGTATGCC TTATGTCAAA " 
TTTAAAGATG AGTTTGCAAG AGCAAAAATG TTTGCTCGTA ATTATAGAGG AGACGAGGTA 
GACAAGATGA TAAGAGCTAT CGACAAGCTG TGTGATGTTT ATAAAAAAGT TGCGCTTTAG 

t31-2 .nt • • ; 

TTGCAAGTTTTTTGGAAATAAAAGCGCAAGTAAAGAAAAAGAAGAAACTTCTTTTTCTGATACTGCTAGCAAGATT 
AGTAAGTCGGGAACAGCTGCTTCTTCAGACAAACAAGAAAAAAATACAAGTGATGTTACAGGTGACGCCAAAAAGC 
ATACTAGTAGCCCTTACATGCTTGCTGATGCCCTTATTGTTAGTGATACTACTAATAGAGATAGAGATAAGCAAGA 
AAATAAAGATAAATTAAATGAAGAAGATAAAAAAAAGCTTAATGCTTTTTTTAGCACAACTAAAACATATCAATCT 
AGCCTAGATTCCATTTATAACAAATATACAGGCTATTATAATACCATTGATACCTATGGCAGCTGTGATACGTATC 
GCATTGAGTGTTTTAGTGTAGGACCTTCTGAAAAACGTAAACAAGCTCTTGCTGATCTAGAGAAGTTAAAACTAGA 
CGAAAAGTACACTCAGCTTAGCACAATGTTAAAGAGTGCTGTGCCTAGTTATTACAAAAAAAATTTAGATGATTCT 
ATTGCACAGTATAAGGAAGCCATAAAGCAGGCTATTGAAGCTGAAAGTAAAATAGAGACAGTAAAAGACTATGCAA 
CAGCTCAAAGTGCTGCCGATGACGAAAAGAAAAGAAATATAGATAATTTAAAAATAGTTAGAGATGTTCTTCTTAT 
TATTAAAAAAACTATTGAGAAAGCCAGCCGATCTTATGCTGATGCTTTTGCTATTGCAACATCTAGCTTATCTTGT 
^AGCGAATTTAAGCAAGCTGTTAAAGAGTTTAATGATGCTGCTAAACAATATGCTAATGGAAATAAAGGAGACAATG 
CTGTCAATGTTATTGTAGGCACTATTTCTAGTATGCCTTATGTCAAATTTAAAGATGAGTTTGCAAGAGCAAAAAT 
GTTTGCTCGTAATTATAGAGGAGACGAGGTAGACAAGATGATAAGAGCTATCGACAAG • 

f 31-2 .aa '\ ' , . 

KNKEVLMKRK SNICISLLVT ILFVSCKFFG NKSASKEKEE TSFSDTASKI . SKSGTAASSD- 

KQEKNTSDVT GDAKKHTSSP YMLADALIVS DTTNRDRDKQ ENKDKLNEED KKKLNAFFST 

TKTYQSSLDS IYNKYTGYYN TIDTYGSCDT YRIECFSVGP SEKRKQALAD LEKLKLDEKY 

TQLSTMLKSA VPSYYKKNLD DSIAQYKEAI KQAIEAESKI ETVKDYATAQ SAADDEKKRN 

IDNLKIVRDV LLIIKKTXEK ASRSYADAFA IATSSLSCSE FKQAVKEFND 'AAKQYANGNK 

GDNAVNVXVG TISSMPYVKF KDEFARAKMF ARNYRGDEVD KMIRAXDKLC DVYKKVAL 

t31-2..aa 

CKFFGNKSASKEKEETSFSDTASKISKSGTAASSDKQEKNTSDVTGDAKKHTSSPYMLAIDALiySDTTNRDRDKQE 
NKDKLNEEDKKKLNAFFSTTKTYQSSLDSIYNKYTGYYNTIDTYGSCDTYRXECFSVGPSEKRKQALADLEKLKLD 
EKYTQLSTMLKSAVPSYYKKNLDDSIAQYKEAIKQAIEAESKIETVKDYATAQSAADDEKKRNIDNLKIVRDVLLI 
IKKTIEKASRSYADAFAIATSSLSCSEFKQAVKEFNDAAKQYANGNKGDNAVNVIVGTISSMPWKFKDEFARi^ 

F ARNYRGD E VD KM I RA I DK 



f 32-4 .nt 

TAAGGAAATA TGAGGAATAT 
GGATTATTAA TTTTTTGTTG 
AACTTTAAAG AAGAGCGGAA 
TGTTATTTTA AAGAATTTAA 



TAGCAATTGT ATCAAATATA 
TGCAACCTTT GTTTGGTTGA 
TTATTCAATA AGCCCAATAG 
GTCTGGACTT ATT AAAAG C G 



TTATATTAAC AATGCTTATT 

TTGGAATTTT TTATTCAAAT 

ATAGTGTTAT TATGCGTAAA 

TATTCTTTAA GAAATTAGAT 
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GTAAATGTTA ACTCTAAAAA TTTTAAGGAG CTAAATAAGG TAGATAAACA AAATCTGCTA 

AATTCTTATC CATCTTATCA TATGGAGTTT GTCGTAGTTG ATAATGGATT TTTAATGAAT 

TTTAAAAATG TTATTTTTAA TGGTATAGAT GATGCTAAAT TATACGATCA ACGTGATATG 

GTTTACGGAG GATTTAGATa' CTCAAAAGAG GCTTATTTCC AAATTATTGG CAATTATGAT 

GTTAAATTAA ATAAAATGAA AC AATAT AC T CCAGCAATTG TAGTAAATGT TTTCAAAATT 

AACATTAATG ATGCTTTATT TAACTCGTTA TTAAAGCAAA AAACTTTAAA AGTTACTTTG 

-ATTTCCCATA ATAATAAAGA GTATATTTTA CAAACTAATA ATTTCTTATC AAAGTATAAT 
TTTCAAACAC CAGAAAAGGA GAATAGTTCT TACTAA 

t32-4.nt 

AAATAACTTTAAAGAAGAGCGGAATTATTCAATAAGCCCAATAGATAGTGTTATTATGCGTAAATGTTATTTTAAA 

GAATTTAAGTCTGGACTTATTAAAAGCGTATTCTTTAAGAAATTAGATGTAAATGTTAACTCTAAAAATTTTAAGG 

AGCTAAATAAGGTAGATAAACAAAATCTGCTAAATTCTTATCCATCTTATCATATGGAGTTTGTCGTAGTTGATAA 

TGGATTTTTAATGAATTTTAAAAATGTTATTTTTAATGGTATAGATGATGCTAAATTATACGATCAACGTGATATG ■ 

GTTTACGGAGGATTTAGATACTCAAAAGAGGCTTATTTCCAAATTATTGGCAATTATGATGTTAAATTAAATAAAA 

TGAAACAATATACTCCAGCAATTGTAGTAAATGTTTTCAAAATTAACATTAATGATGCTTTATTTAACTCGTTATT 

AAAGCAAAAAACTTTAAAAGTTACTTTGATTTCCCATAATAATAAAGAGTATATTTTACAAACTAATAATTTCTTA 

TCAAAGTATAATTTTCAAACACCAGAAAAGGAGAATAGTTCTTAC 

f32-4 .aa „ . 

GNMRWISNCI KYIILTMLIG LLIFCCATFV WLIGIFYSNN FKEERNYSIS • PIDSVIMRKC 

YFKEFKSGLl" KSVFFKKLDV NVNSKNFKEL NKVDKQNLLN SYPSYHMEFV WDNGFLMNF 

KNVIFNGIDD AKLYDQRDMV YGGFRYSKEA YFQI IGNYDV KLNKMKQYTP AIWNVFKIN 

INDALFNSLL KQKTLKVTLI SHNNKEYILQ TNNFLSKYNF QTPEKENSSY 

t32-4.aa 

NNFKEERNYSISPIDSVIMRKCYFKEFKSGLIKSVFFKKL^ 
GFLMNFKWIFNGIDDAKLYDQRDMV^^ 
■ KQKTLKVTLI SHNNKEYILQTNNFLSKYNFQTPEKENSSY 

f4-15.nt 

TAAATGAGCA AAAAAGTAAT TTTAATATTA CTAGAAATTT TGATCTTGTC . TTGTGATTTA 
TCTATAAATA AAGAAC AAAA AACCAAAGAA AAAACATCTG AAAAGCAAGA ATCTGAAAAA 
CAAAATATTG AAAAACAAGA GCC TGAAAAA CAGAAACAAA ATGCAGCAAA AATAATCCCT 
ACGGTATCAA TTCAAACGGT AGAAATAAGG GAATCAAATC AAATTCCAAA AAGCATTGAG 
AAGTACTACA AGCAAGCTTA TCCGATTCAA ACATTCACTC TTGATTTTAG CATCACAAGA 
GAAAAGGAAT TTCTAAAACC AGAAGATAAA ATCTTGCCCA CACAGGGGAA AGTGGAGTCT 
TTGAGCATCT TAATAAATAA AAAATTGTTA GACTTTAAAG - CCCCAGAAAA TC C AAAAAGC 
TCAACTTTAA AAAATTTCAA AGAAATTAAA AATATTGAGA ATTTGTTCCA AAATCAAGAC 
TTATTATTTG TCTTAACCCT TAAAGATAAA AATAACAACA ACACTATTAA CATCATGCTC 
AATCCCCCAA ACGACATCCA AAAACCCAAA GATTATATTT TAAAAGACCT TAAAGACACA 
ATTAAAAAGG GT AC TGGTG A GAAATACTTA AATCCTATCT ATAGATTTCA AATAAAAAAC 
AAAAAAGATT ATCATTCAAT AGATTACAAC AAAGTGACTA TTAGCGAAAA AACAATAGAA 
TTGGACCTAC TGCCTCACGA ACAAGTCTTT CAAATGAATA AAAATTTCAC TAAAATTTTA 
GACACAATAA CAGACTTAAA TAATCTAAAA TTAGTAATTC AAAAAGAATT AGTGTAA 

' t4-15 .nt , 
TTGTGATTTATCTATAAATAAAGAACAAAAAACCAAAGAAAAA 

ATTGAAAAAaAAGAGCCTGAAAAACAGAAACAAAATGCAGCAAAAATAATCCCTACGG 

AAAT AAGGG AATC AAATC AAATTC C AJ^J\AGC ATTG AGAAGT AC T AC AAGC AAGC TT ATC CG ATTC AAAC ATTC AC 
TC TTG ATTTT AGC ATC AC AAGAG AAAAGG AATTTC T AAAAC C AG AAG AT AAAATC TTGC CC AC AC AGGGG AAAGTG 
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GAGTCTTTGAGCATCTTAATAAATAAAAAATTGTTAGACTTTAAAGCCCCAGAAAATCCAAAAAGCTCAACTTTAA 
AAAATTTCAAAGAAATTAAAAATATTGAGAATTTCTTCCAAAATCAAGACTTATTATTTGTCTTAACCCTTAAAGA 
TAAAAATAACAACAACACTATTAACATCATGCTGAATGCCCCAAACGACATCCAAAAACCCAAAGATTATATTTTA 
AAAGACCTTAAAGACACAATTAAAAAGGGTACTGGTGAGAAATACTTAAATCCTATCTATAGATTTCAAATAAAAA 
ACAAAAAAGATTATCATTCAATAGATTACAACAAAGTGACTATTAGCGAAAAAACAATAGAATTGGACCTACTGCC 
TCACGAACAAGTCTTTCAAATGAATAAAAATTTCACTAAA 

f4-15.aa ' 

MSKKVILILL EILILSCDLS INKEQKTKEK TSEKQESEKQ NIEKQEPEKQ KQNAAKIIPT 

VSIQTVEIRE SNQIPKSIEK YYKQAYPIQT FTLDFSITRE KEFLKPEDKI LPTQGKVESL 

SILINKKLLD FKAPENPKSS TLKNFKEIKN- IENFFQNQDL LFVLTLKDKN NNNTINIMLN 

PPNDIQKPKD YILKDLKDTI KKGTGEKYLN .PIYRFQIKNK KDYHSIDYNK VTISEKTIEL 

DLLPHEQVFQ MNKNFTKILD TITDLNNLKL VIQKELV 

t4-15 .aa 

CPLSINKEQKTKEKTSEKQESEKQNIEKQEPEKQKQNAAKIIPTVSIQTVEIRESNQIPKSIEKYYKQAYPIQTFT 
LDFSITREKEFLKPEDKILPTQGKVESLSILINKKLLDFKAPENPKSSTLKNFKEIKNIENFFQNQDLLFVLTLKD 
KNNNNTINIMLNPPNDIQKPKDYILKDLKDTIKKGTGEKYLNPIYRFQIKNKKDYHSIDYNKVTISEKTIELDLLP 

HEQVFQMNKNFTK 
f 4-50 .nt 

TAGAAGGAGG AAAAAATGAA' AATTGGAAAG CTAAATTCAA TAGTTATAGC CTTGTTTTTT 

AAACTATTGG TCGCATGTAG TATTGGATTA GTAGAAAGAA CAAATGCAGC TCTTGAATCG 

TCCTCTAAGG ATTTAAAAAA CAAAATTTTA AAAATAAAAA AAGAAGC C AC GGGAAAAGGT 

GTACTTTTTG AAGCTTTTAC AGGTCTTAAA AC C GGTTC C A AGGTAACAAG TGGTGGACTA 

GCCTTAAGAG AAGCAAAAGT ACAAGCCATT GTTGAAACAG GAAAGTTCCT TAAGATAATA 

GAAGAAGAAG CTTTAAAGCT TAAAGAAACT GGAAACAGTG GTCAATTCTT GGCTATGTTT 

GACTTAATGC TTGAGGTTGT AGAATCGCTA GAAGACGTTG GAATAATAGG CTTAAAAGCC 

CGTGTTTTAG AGGAATC TAA AAATAATCCT ATAAACACAG C TGAAAG ATT GCTTGCGGCT 

AAAGCTCAAA- T AG AAAATC A ACTTAAAGTG GTTAAGGAAA AACAAAATAT TGAAAATGGT 

GGAGAGAAAA AAAATAATAA AAGCAAAAAA AAGAAATAA 

t4-50.nt 

ATGTAGTATTGGATTAGTAGAAAGAACAAATGCAGCTCTTGAATCGTCCTCTAAGGATTTAAAAAACAAAATTTTA 

AAAATAAAAAAAGAAGCCACGGGAAAAGGTGTACTTTTTGAAGCTTTTACAGGTCTTAAAACCGGTTCCAA^ 

C AAGTGGTGGAC T AGC C TT AAG AGAAGC AAAAGTAC AAGCC ATTGTTGAAAC AGG AAAGTTC CTTAAGAT AAT AGA 

AG AAGAAGC TTTAAAQCTTAAAGAAACTGGAAACAGTGGTCAATTCTTGGCTATGTTTG AC TTAATGCTTGAGGTT i 

GTAGAATCGCTAGAAGACGTTGGAJITAATAGGCTTAAAAGCCCGTGTTTTAGAGGAATCTAAAA^ 

ACACAGCTGAAAGATTGCTTGCGGCTAAAGCTCAAATAGAAAATCAACTTAAAGTGGTTAAGGAAAAACAAAATAT 

TG AAAATGGTGGAGAGAAAAAAAAT AAT AAAAGC AAAAAAAAGAAA 
f4-50.aa 

KEEKMKIGKL NSIVIALFFK LLVACSIGLV ERTNAAL ESS SKDLKNKILK I KKEATGKGV 
LFEAFTGLKT GSKVTSGGLA LREAKVQAIV ETGKFLKIIE EEALKLKETG NSGQFLAMFD 
LMLEWESLE DVGIIGLKAR VLESSKNNPI NTAERLLAAK AQIEKQLKW KEKQMIENGG 
EKKNNKSKKK K 



t4-50 .aa 
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CSIGL : /3?-TNAAIjESSSKDLKNKILKIKKEATGKGVLFEAFTGLKTGSKVTSGGLALREAKVQAIVETGKFLKIIE 

EiALKIuKETGNSGQFIJ^FDLMLEVVESLEDVGIIGLKARVLEESKNNPIOT 

ENGG Z2GGTNK S KKXK " 



TAATTTTTAA AATTTAAATA TTTACATAAT AGTAATGTGT GTGGG AG AC G TATGAAAAAT 

ATTTTATTAT TTGTTATTTT ATTATTCTTT TCTTGTAAAG AATTTAATTA TTCTGATCTT 

AGGAGAAGGC CTTCAAAGGT TTTAAATGCT TC T AATGGTG CATCAAATAA AGAACTTAAA 

ATTTCTTTTG TA.GATTCTTT AAATGATGAT CAAAAAGAAG CTTTGTTTTT TCTTGAACAG 

GTAGTTCTTG ATAGCAATCC CGACAAGTTT AATCAAATTT TTAATTTAAA TGAAGAGAAG 

GTAAAAGAAA TGCTTGTTAC TGTTGTTAAG ' TGTTTAAAGG CCAAAAGAAA GGC TAAAATG 

GCTCTTGAGA GCTCAAATGT TGCAAATGTT GCCAATGCTA AACAGCAATT GCTACAGGTT 

GAAAAAACTT ACATAGATAA TTTGCGACAA TCTTTTATGA CTACTAAAAA CATTGAAGAG 

GCTTGTAATC TTGTAAAAAA TTATGATGCA TCTGCTTCGT TTTAA 

c4 - 6 c . n~ . 

TTGTAAAGAATTTAATTATTCTGATCTTAGGAGAAGGCCTTCAAAGGTTTTAAATGCTTCTAATGGTGCATCAAAT 
' £ A AlGAAJCTTAAA ATTTCTTTTGTAGATTCTTTAAATGATGATC AAAAAGAAGCTTTGTTTTTTCTTGAACAGGTAG 
TTCTTGATAGCAATCCCGACAAGTTTAATCAAATTTTTAATTTAAATGAAGAGAAGGTAAAAGAAATGCTTGTTAC 
TGTTGTTAJVGTGTTTAAAGGCCAAAAGAAAGGCTAAAATGGCTCTTGAGAGCTCAAATGTTGCAAATGTTGCCAAT 
GCTAAACA.GCAATTGCTACAGGTTGAAAAAACTTACATAGATAATTTGCGACAATCTTTTATGACTACTAAAAACA 
TTGAAGAGGCTTGTAATCTTGTAAAAAATTATGATGCATCTGCTTCGTTT 

f 4-6 5 . aa - 

FLKFKYLHNS NVCGRRJMKNI LLFVILLFFS CKEFNYSDLR RRPSKVLNAS NGASNKELKI 

SFVDSLNDDQ KEALFFLEQV VLDSNPDKFN QIFNLNEEKV KEMLVTVVKC LKAKRKAKMA 

LESSNVANVA NAKQQLLQVE KTYIDNLRQS FMTTKNIEEA CNLVKNYDAS ASF . 



CKZFJr^SDLRRRPSKVLNASNGASNKELKISFVDSLNDDQKEALFFLEQVVLDSNPDKFNQIFNLNEEKVKEMLVT 
WKC LKAKRKAKMALES S NVANVANAKQQ LLQVEKTYIDNLRQSFMTTKNIEEACNLVKNYDASASF 

f 42-1 .nt: • 

TAATTATTAA AATCTAAGGA GAAGAGATTT ATGAACAAAA AATTTTCTAT TTCATTATTA 
TCTACAATAT TAGCCTTCTT GTTAGTATTA GGTTGTGATT TGTCAAGCAA TAATGCTGAA - 
AACAAAATGG ATGATATTTT TAATTTAGAA AAGAAATACA TGGATAATTC AAATTATAAA 
TGTTTAAGTA AAAATGAGGC TATAGTTAAA AATTCTAAAA TTAAATTAGG . TGTAAATAAT 
ACTAGAAGTC GTTCTTATTC TTCTAGAGAG ACTAATGTTT CGGATTCCTA TAATAAAACC 
TATTCATATT GCAAAAGCAA CTGA 

t42-l .n: • . . ' 

TTCTGATTTGTCAAGCAATAATGCTGAAA^CAAAATGGATGATATTTTTAATTTAGAAAAGAAAT 
TCAJLiTTATAAATGTTTAi^.GTAAAAATGAGGCTATA-GTTAAAAATTCTAAAA^ 

GA-AGTCGTTCTTATTCTTCTAGAGAGACTAATGTTTCGGATTCCTATAATAAAACCTATTCATATTGCAAAAGCAA 
C 

£42-1 /'aa 



LLK5XZKRFM NKKFSISLLS TILAFLLVLG CDLSSKNAEN KMDDIFNLEX KYMDNSNYKC 
LSXNZAIVKN SKIKLGVNNT RSRSYSSRET NVSDSYNKTY SYCKSN 
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t42-l .aa 

CDLSSNNAENKMDDIFNLEKKYMDNSOT 
f43-3.nt 

TGAATATTAA TAATAAAAAA AGGAATAANA ATG AAAATTA * TCAACATATT ATTTTGTTTA 
TTTTTACTAA TGCTAAACAG CTGTAATTCT AATGATACTA AT AC TAGC C A AACAAAAAGT 
AGACAAAAAC GTGATTTAAC CCAAAAAGAA GCAACACAAG AAAAACCAAA ATCTAAAGAA 
GACCTGCTTA G AG AAAAGC T ATCTGAAGAC CAAAAAACAC ATCTTGACTG GTTAAAAACC 
GCTTTAACTG GTGCTGGAGA ATTTGATAAA TTTTTAGGAT ATGACGAAGA CAAAATAAAA 
GGTGCACTTA ATCATATAAA -GAGTGAACTT GATAAGTGTA CTGGGGATAA TTCTGAA'CAA 
CAAAAAAGCA CCTTCAAAGA GGTGGTTAAG GGGGCTCTTG GTGGCGGTAT AGATAGTTTT 
GCAACTAGTG CAAGTAGTAC CTGCCAAGCT CAGCAATAA 

m ' 

isil 

if'} ' t43-3.nt 

h j CTGTAATTCTAATGATACTAATACTAGCCAAACAAAAAGTAGACAAAAACGTGAT-TTAACCCAAAAAGAAGCAACA 
CAAGAAAAACCAAAATCTAAAGAAGACCTGCTTAGAGAAAAGCTA^CTGAAGACCAAAAAACACATCTTGACTGGT 
TAAAAACCGCTTTAACTGGTGCTGGAGAATTTGATAAATTTTTAGGATATGACGAAGACAAAATAAAAGGTGCACT 
TAATCATATAAAGAGTGAACTTGATAAGTGTACTGGGGATAATTCTGAACAACAAAAAAGCACCTTCAAAGAGGTG 
GTTAAGGGGGCTCTTGGTGGCGGTATAGATAGTTTTGCAACTAGTGCAAGTAGTACCTGCCAAGCTCAGCAA 



□ 

f-U 
W 
(21 

5; 

13 

a:? 

'I) 



f43-3 .aa 



ILIIKKGIXM KIINILFGLF LLMLNSCNSN DTNTSQTKSR QKRDLTQKEA TQEKPKSKED 

LLREKLSEDQ KTHLDWLKTA LTGAGEFDKF LGYDEDKIKG ALNHIKSELD KCTGDNSEQQ 

'■^1 KSTFKEWKG ALGGG IDS FA TSASSTCQAQ Q 1 

a ' . . . ■ ' . - ' 

M t43-3.aa 

CNSNDTNTSQTKSRQKRDLTQKEATQEKPKSKEDLLREKLSEDQKTHLDWLKTALTGAGEFDKFLGYDEDKIKGAL 
NHIKSELDKCTGDNSEQQKSTFKEWKGALGGGIDSFATSA'SSTCQAQQ 

f45-2.nt 

TAGGAGAGAA TAATTATGAA TAAAAAAACA TTGATTATTT GTGCTGTTTT TGCGCTGATA 

ATTTC TTGC A AG AATTTTGC AACTGGTAAA GAT AT AAAAC AAAATTCAGA AGGGAAAATT' 

AAAGGATTTG TAAATAAGAT TTTAGATCCA GTAAAGGATA AAATTGCTTC AAGTGGTACA 

AAAGTAGATG AAGTAGCAAA AAAATTACAA GAAGAAGAAA AAGAAGAATT AATGCAGGGC . 

' GATGATCCTA ATGGGAGTGG AATAAATCCG CCACCAGTAT TGCCGGAAAA TATTCACAAT 

AATGCATTAG TATTAAAAGC AATAGAACAA AGTGATGGTC AACAAGAAAA AAAAGTAGAA 

GAAGCTGAAG CTAAAGTTGA AGAAAATAAA GAAAAACAAG AGAATACAGA AGAAAACATT 

AAAGAAAAAG AAATAATAGA CGAACAAAAC AAACAAGAAT TAGCTAAAGC TAAAGAAGAA 

GAACAACAAA AAGAACAAAA AAGACATCAA GAAGAGCAAC AAAGAAAAGC TAAAGCAGAA 

AAAGAAAAAA GAGAAAGAGA AGAGGCAGAA CAACAAAAAC GACAACAAGA AGAGGAAGAA 

AAAAGGCAAG TTGATAACCA AATTAAAACA CTTATAGCTA AAATAGATGA GATCAATGAA 

AATATTGATG TTATAAAATG GC AAACGAC T GTAGGCCCAC AAGGC GTT AT AGATAGAATT 

AC TGGGCC TG TGTATGATGA TTTTACCAAT GGCAATAATT CTATACGCGA AAC TTGGG AG 

GGGTTAGAAG AGGAATCAGA AGACGAAGGA TTAGGAAAAT TATTGAAAGA ATTGAGTGAT 

GCTAGGGACG CGCTAAGAAC TAAATTAAAT GAAGGCAATA AAC CAT AT AC TGGTTACGAA 

GAGCCTAAGT . TAAAAGAAAG TGTAAATGTT AGCGAAATTA AAGAAGATTT AGAAAAATTA 

AAATCAAAAT TAGAAGAAGT TAAAAAATAT CTTAAAGATA GTTCTAAATT TGAAGAAATT 
AAAGGATACA TCAGTGACAG TCAGTAA 

t45-2 .nt 
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TTGCAAGAATTTTGCAACTGGTAAAGATATAAAACAAAATTCAGAAGGGAA^TTAAAGGATTTGTAAATAAGATT 

TTAGATCCAGTAAAGGATAAAATTGCTTCAAGTGGTAGAAAAGTAGATGAAGTAGCAAAAAAATTACAAGAAGAAG 

AAAAAGAAGAATTAATGCAGGGCGATGATCCTAATGGCAGTGGAATAAATCCGCCACCAGTATTGCCGGAAAATAT 

TCACAATAATGCATTAGTATTAAAAGCAATAGAACAAAGTGATGGTCAACAAGAAAAAAAAGTAGAAGAAGCTGAA 

GCTAAAGTTGAAGAAAATAAAGAAAAACAAGAGAATACAGAAGAAAACATT^ 

AAAACAAACAAGAATTAGCTAAAGCTAAAGAAGAAGAACAACAAAAAGAACAAAAAAG 

AAGAAAAGC TAAAGC AGAAAAAGAAAAAAGAGAAAGAGAAGAGGC AG AAC AAC AAAAACG AC AAC AAGAAG AGGAA 
GAAAAAAGGCAAGTTGATAACCAAATTAAAACACTTATAGCTAAAATAGATGAGATCAATGAAAATATTGATGTTA 
TAAAATGGCAAACGACTGTAGGCCCAGAAGGCGTTATAGATAGAATTACTGGGCCTGTGTATGATGATTTTACCAA 
TGGCAATAATTCTATACGCGAAACTTGGGAGGGGTTAGAAGAGGAATCAGAAGACGAAGGATTAGGAAAATTATTG 
AAAGAATTGAGTGATGCTAGGGACGCGCTAAGAACTAAATTAAATGAAGGCAATAAACCATATACTGGTTACGAAG 
AGCCTAAGTTAAAAGAAAGTGTAAATGTTAGCGAAATTAAAGA 

AGTTAAAAAATATCTTAAAGATAGTTCTAAATTTGAAGAAATTAAAGGATACATCAGTGAGAGTCAG 



- ERI I MNKKT L IICAVFALII SCKNFATGKD IKQNSEGKIK GFVNKILDPV KDKIASSGTK 

VDEVAKKLQE EEKEELMQGD D.PNGSGINPP PVLPENIHNN ALVLKAIEQS DGQQEKKVEE 

AEAKVEENKE KQENTEENIK EKEIIDEQNK QELAKAKEEE -QQKEQKRHQE EQQRKAKAEK 

EKREREEAEQ QKRQQEEEEK RQVDNQIKTL IAKIDEINEN IDVIKWQTTV GPQGVIDRIT 

GPVYDDFTNG NNSIRETWEG LEEESEDEGL GKLLKELSDA RDALRTKLNE GNKPYTGYEE 

PKLKESVNVS EIKEDLEKLK SKLEEVKKYL KDSSKFEEIK GYISDSQ 



CKNFATGKDIKQNSEGKIKGFVNKILDPVKDKIASSGTKVDEVAKKLQEEEKEELMQGDDPNGSGINPPPVLPENI 
HNNALVLKAIEQSDGQQEKKVEEAEAKVEENKEKQENTEENIKEKEIIDEQNKQELAKAKEEEQQKEQKRHQEEQQ- 
RKAKAEKEKREREEAEQQKRQQEEEEKRQVDNQIKTLIAKIDEINENIDVIKWQTTVGPQGVIDRITGPVYDDFTN 
GISTNSIRETWEGLEEES EDEGLGKLLKELSDARDALRTKLNEGNKPYTGYEE PKLKESVNVS EIKEDLEKLKSKLEE 
VKKYLKDSSKFEEIKGYISDSQ ' 



CTGTAATTCCAATGATAATGACACTTTAAAAAACAATGCCCAACAAACAAAAAGCAGGAAAAAACGTGATTTAAGC 
CAAGAAGAACTGCCACAACAAGAAAAAATCACTTTAACATCCGACGAAGAAAAAATGTTTACTTCATT^ 
TGTTTAAATACACAATTGAAAAATTAAACAATGAAATACAAGGGTGCATGAATGGAAACAAAAGTAAATGTAAT 
CTTCTTTGATTGGCTTTCTGAAGATATT 

CAAAAACAAAAAGAATTAGCTGGTGCTTTTACCAAGGTTTACAACTTCTTAAAATCAAAAGCACAAAA^ 
TTGATACTTATATTAAAGGAGCTATTGATTGTAAAAAAAAXACTCCACAAGATTGTAATAAAAATAATGAA 



f45-2 .aa 



t45-2 .aa 



f47-2 .nt 



TGAATATTAA 
TCTTTGCTAC 
CAACAAACAA 
AAAATCACTT 
TACACAATTG 
TGTAATGACT 
GCTTTTACCA 
TATATTAAAG 
GAAATATGGG 



TAATAAAAAA AGGAGTAACA ATGAAAATCA TCAACATATT ATTTTGTATA 
TACTAAATAG CTGTAATTCC AATGATAATG ACACTTTAAA - AAACAATGCC 
AAAGCAGGAA AAAACGTGAT TTAAGCCAAG AAG AAC TGC C ACAACAAGAA 
TAACATCCGA CGAAGAAAAA" ATGTTTACTT CATTAATCAA TGTGTTTAAA 
AAAAATTAAA CAATGAAATA C AAGGGTGC A TGAATGGAAA CAAAAGTAAA 
TCTTTGATTG GCTTTCTGAA GATATTCAAA AACAAAAAGA ATTAGCTGGT 
AGGTTTACAA CTTCTTAAAA TCAAAAGCAC AAAATGAAAC TTTTGATACT 
GAGCTATTGA TTGTAAAAAA AACACTCCAC AAGATTGTAA TAAAAATAAT 
GAGGTGGACA ACTTANTAGN GCAATATTTT AG 



t47-2 .nt 



f 47-2 ..aa 



I L I I KKGVTM KIINILFCIS . LLLLNSCNSN DNDTLKNNAQ QTKSRKXRDL SQEELPQQEK 
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ITLTSDEEKM FTSLINVFKY TIEKLNNEIQ GCMNGNKSKC NDFFDWLSED IQKQKELAGA 
FTKVYNFLKS KAQNETFDTY IKGAIDCKKN TPQDCNKNNE IWGGGQLXXA IF 

t47-2.aa 

CNSNDNDTLKNNAQQTKSRKKRDLSQEELPQQEKITL^ 

FFDWLSEDIQKQKELAGAFTKVYNFLKSKAQNETFDTYIKGAIDCKKNTPQDCNKNNE 
f49-2.nt 

. TAAATGTTCA AAACAATCAT . TAAACAAAAA AATATGAAAA AAATTTCAAG TGCAATTTTA 
TT AAC AACTT TCTTTGTTTT TATTAATTGT AAAAGCCAAG TTGCTGATAA GGCGAGTGTG 
ACGGGGATTG CTAAGGGAAT AAAGGAGATT GTTGAAGCTG CTGGGGGGAG TGAAAAGCTG 
AAAGTTGCTG CTGCTGAAGG GGAGAATAAT GAAAAGGCAG GGAAGTTGTT TGGGAAGGCT 
GGTGC TGGT A ATGCTGGGGA CAGTGAGGCT GCTAGCAAGG CGGCTGGTGC TGTTAGTGCT '• 
GTTAGTGGGG AGCAGATATT AAGTGCGATT GTTAAGGCTG CTGGTGAGGC TGCGCAGGAT 
GGAGAGAAGC CTGGGGAGGC TAAAAATCCG ATTGCTGCTG CTATTGGGAA GGGTAATGAG 
GATGGTGCGG AGTTTAAGGA TGAGATGAAG AAGGATGATC AGATTGCTGC TGCTATTGCT 
TTGAGGGGGA TGGCTAAGGA TGGAAAGTTT GCTGTGAAGA ATGATGAGAA AGGGAAGGCT 
GAGGGGGCTA TTAAGGGAGC TGGCGAGTTG TTGGATAAGC TGGT AAAAG C TGTAAAGACA 
GCTGAGGGGG CTTCAAGTGG TACTGCTGCA ATTGGAGAAG TTGTGGCTGA TGATAATGCT 
GCGAAGGTTG CTGATAAGGC GAGTGTGAAG GGGATTGCTA AGGGGATAAA GGAGATTGTT 
GAAGCTGCTG GGGGGAGTAA AAAGCTGAAA GTTGCTGCTG CTAAAGAGGG CAATG AAAAG 
GCAGGGAAGT TGTTTGGGAA AGTTGATGCT GCTCATGCTG GGGACAGTGA GGCTGCTAGC 
AAGGCGGCTG GTGC TGTT AG TGCTGTTAGT GGGGAGCAGA T ATT AAGTG C GATTGTTAAG 
GCTGCTGGTG CGGCTGCTGG TGATCAGGAG GGAAAGAAGC CTGGGGATGC TAAAAATCCG 
ATTGCTGCTG CTATTGGGAA GGGTGATGCG GAGAATGGTG CGGAGTTTAA TCATGATGGG 

. ATGAAGAAGG ATGATCAGAT TGCTGCTGCT ATTGCTTTGA GGGGGATGGC TAAGGATGGA 
AAGTTTGCTG TGAAGAGTGG TGGTGGTGAG AAAGGGAAGG CTGAGGGGGC TATTAAGGGA 
GCTGCTGAGT TGTTGGATAA GCTGGTAAAA GCTGTAAAGA CAGCTGAGGG GGCTTCAAGT 
GGTACTGATG CAATTGGAGA AGTTGTGGCT AATGCTGGTG CTGCAAAGGT TGCTGATAAG 
GCGAGTGTGA CGGGGATTGC TAAGGGGATA AAGGAGATTG TTGAAGCTGC TGGGGGGAGT 
GAAAAGCTGA AAGTTGC TGC TGCTACAGGG GAGAGTAATA AAGGGGCAGG GAAGTTGTTT 
GGGAAGGCTG GTGC TGGTGC TAATGCTGGG GACAGTGAGG CTGCTAGCAA GGCGGCTGGT 
GCTGTTAGTG C TGTT AGTGG GGAGCAGATA TTAAGTGCGA TTGTTAAGGC TGCTGATGCG 
GCTGATCAGG AGGGAAAGAA GCCTGGGGAT GCTANAAATC CGATTGCTGC - TGCTATTGGG 
AAGGGTNATG NGG AGAATGG TGCGGAGTTT AANNATGANG GATGA 

o 

t49-2.nt - *.-*-. 

TTGTAAAAGCCAAGTTGCTGATAAGGCGAGTGTGACGGGGATTGCTAAGGGAATAAAGGAGATTGTTGAAGCTGCT 
GGGGGGAGTGAAAAGCTGAAAGTTGCTGCTGCTGAAGGGGAGAATAATGAAAAGGCAGGGAAGTTGTTTGGGAAGG 
CTGGTGCTGGTAATGCTGGGGACAGTGAGGCTGCTAGCAAGGCGGCTGGTGCTGTTAGTGCTGTTAGTGGGGAGCA 
GATATTAAGTGCGATTGTTAAGGCTGCTGGTGAGGCTGCGCAGGATGGAGAGAAGCCTGGGGAGGCTAAAAATCCG 
ATTGCTGCTGCTATTGGGAAGGGTAATGAGGATGGTGCGGAGTTTAAGGATGAGATGAAGAAGGATGATCAGATTG 
CTGCTGCTATTGCTTTGAGGGGGATGGCTAAGGATGGAAAGTTTGCTGTGAAGAATGATGAGAAAGGGAAGGCTGA 

GGGGGCTATTAAG 
£49-2 .aa " 

MFKTIIKQKN MKKISSAILL TTFFVFINCK SQVADKASVT GIAKGIKEIV EAAGGSEKLK 
VAAAEGENNE KAGKLFGKAG AGNAGDSEAA SKAAGAVSAV SGEQILSAIV KAAGEAAQDG 
EKPGEAKMPI AAAlIGKGNED GAEFKDEMKK DDQIAAAIAL RGMAKDGKFA VKNDEKGKAE 
• GAIKGAGELL • DKLVKAVKTA EG AS SGTAAI G E W AD DN AA KVAD KAS VKG IAKGIKSIVE 
AAGGSKKLKV AAAKEGNEKA GKLFGKVDAA HAGDSS^ASK AAGAVSAVSG EQILSAIVKA 
AGAAAGDQEG KKPGDAKNPI AAA I G KGDAE NGAEFNHDGM KKDDQ I AAA I ALRGMAXDGK 
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FAVKSGGGEK GKAEGAI KG A AELLDKLVKA VKTAEGASSG TDAIGEWAN AGAAKVADKA 
SVTGIAKGIK E I VEAAGGS E KLKVAAATGE SNKGAGKLFG KAGAGANAGD SEAASKAAGA 
VSAVSGEQIL SAIVKAADAA DQEGKKPGDA XNP I AAAIGK GXXENGAEFX XXG 

t49-2.aa 

CKSQVADKAS VTGI AKG I KE I VEAAGG S EKLKVAAAEGENNEKAGKLFGKAGAGNAGDS EAASKAAGAVS AVSGEQ 
ILSAIVKAAGEAAQDGEKPGEAKNPIAAAIGKGNEDGAEFKDEMKKDDQ 
GAIK 

f5-14.nt 

TAGAAATTCA AAACAAAGGA 
GCTGTTTTTG CACTTATAAT 
AATGCAAAAG GGAAAATTAA 
ATTACTTCAA GT AGTT C AAA 
GATAATGAAT TAATGCAGGG 
TTGCCGGAAA ATAGTCATGA 
GGTGGTCAAC AAGAAGACCA 
GAAGTTGTAG AGGAGAAAAA 
AGTCAAAAGC AAAAAGAAGA 
GCAAAAGCTA GAGCAGATAG 
CAACAGGAAG AAGCTAGGGT 
AAACAAGAAG AAGAAAAGAA 
GAAATAAATA AGGATATTGA 
ATAGATAAAA TTACGGGGCC 
AAAAC TTGGG GAGATTTAGA 
TTGAGTGATA CTAGACATAA 
GTTCTAGAAA AGGAGCCTAA 
TTAGAAAAAT TAAAATCAGG 
TTTGAAGAAA TTAAAGGATA 

t5-14.nt 

TTGTAAAAATTATGCAACTGGTAAAGATATAAAACAAAATGCAAAAGGGAAAATTAAAGGATT 
TTAGATCCAGCAAAAGATAAAATTACTTCAAGTAGTTCAAAAGTAGATGAATTAGCAA 

ATGAAGATAATGAATTAATGCAGGGCGATGATCCTAATAACAGAGCAATAGCACTGTTACCAGTATTGCCGGAAAA 
TAGTCATGACAATCCACCAGTACCAAAAGTAAAAGCAGCAGCACAAAGTGGTGGTCAACAAGAAGACCAAAAAGCA 
'AAAGAATCTAAAGATAAAGTTGAGGAAGAAAAAGAAGTTGTAGA 

AAAAAGTGGAGAAGCAAAGTCAAAAGCAAAAAGAAGAAGAGAGAAACTCTAAAGAAGAACAACA 

AGC AAAAGC T AG AGC AGAT AGAGAAAGAGAAGAACGAC T AAAAC AACAAGAAC AAAAAAG AC AACAGGAAGAAGC T 

AGGGTTAAAGCAGAAAAAGAAAAACAAGAAAGAGAGGAACAACAAAAACAAGAAGAAGAA ' 

AAATTAAAACACTTACAGACAAAATAGATGAAATAAATAAGGATATTGATGGTATAAATGGTAAAACAATTGTAGG 

AGC AGAAG AAGTT AT AGATAAAATT ACGGGGC C TGT AT ATGATGATTTT AC TGATGGGAAT AAAGC TAT AT AC AAA 
ACTTGGGGAGATTTAGAGGATGAAGAAGGCGAAGAATTAGGAAAATTATTGAAAGAATTGAGTGATACTAGACATA 
ATTTAAGAACCAAATTAAATGAGGGTAATAAAGCATATATTGTTCTAGAAAAGGAGCCTAATTTAAAAGAAAATGT 
AAATGTTAGTGATATTCAATCAGATTTAGAAAAATTAAAATCAGGATTAGAAGAAGTTAAAAAATATTTTGAAAAT 
GAAGATAATTTTGAAGAAATTAAAGGATACATTGAGGATAGTAATTCATAT 

f5-14.aa ' . 

KFKTKEKTKS MNKKILIIFA VFALIISCKN YATGKDIKQN AKGKIKGFLD KVLDPAKDKI 

TSSSSKVDEL AKKLQEEDED NELMQGDDPN NRAIALLPVL PENSHDNPFV PKVKAAAQSG 

GQQEDQKAKE SKDKVEEEKE WEEKKEEQD SKKEKVEKQS QKQKEEERNS KEEQQKQEEA 

KARADREREE RLKQQEQKRQ QEEARVKAEK EKQEREEQQK QEEEKKVKYK IKTLTDKIDE 

INKDIDGING KTIVGAEEVI DKXTGPVYDD FTDGNKAIYK TWGDLEDEEG EELGKLLKEL 



GAAAACAAAA AGTATGAATA 
TTC TTGT AAA AATTATGCAA 
AGGATTTTTA GATAAGGTTT 
AGTAGATGAA TTAGCAAAAA 
CGATGATCCT AATAACAGAG 
CAATCCACCA GT AC C AAAAG 
AAAAGCAAAA GAATCTAAAG 
AGAAGAACAA GATAGTAAAA 
AGAGAGAAAC TCTAAAGAAG 
AGAAAGAGAA GAACGACTAA 
TAAAGCAGAA AAAGAAAAAC 
AGTTAAATAT AAAATTAAAA 
TGGTATAAAT GGTAAAACAA 
TGTATATGAT GATTTTACTG 
GGATGAAGAA GGCGAAGAAT 
TTTAAGAACC AAATTAAATG 
TTTAAAAGAA AATGTAAATG 
ATTAGAAGAA GTTAAAAAAT 
CATTGAGGAT AGTAATTCAT 



AAAAAATATT GATTATTTTT 
CTGGTAAAGA TATAAAACAA 
TAGATCCAGC AAAAGATAAA 
AATTACAAGA AGAAGATGAA 
CAATAGCACT GTTACCAGTA 
TAAAAGCAGC AG C AC AAAGT 
ATAAAGTTGA GGAAGAAAAA 
AAGAAAAAGT GGAGAAGCAA 
AACAAC AAAA ' AC AAGAAGAA 
AACAACAAGA ACAAAAAAGA 
AAGAAAGAGA. GGAACAACAA 
CACTTACAGA CAAAATAGAT 
TTGTAGGAGC AGAAGAAGTT 
ATGGGAATAA AGCTATATAC 
TAGGAAAATT ATTGAAAGAA 
AGGGTAATAA AGCATATATT 
TTAGTGATAT TCAATCAGAT 
ATTTTGAAAA TGAAGATAAT 
ATTGA 
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SDTRHNLRTK LNEGNKAYIV LEKEPNLKEN VNVSDIQSDL EKLKSGLEEV KKYFENEDNF 
EEIKGYIEDS NSY 

t5-14.aa . 

CKI^ATGKDIKQNAKGKIKGFLDKVLDPAKDKITSSSSKVDELAKKLQEEDEDNELMQGDDPNNRA 
SHDNPPVPKVKAAAQSGGQQEDQKAKESKDKA/EEEKEVV^ 

AKARADREREERLKQQEQKRQQEEARVKAEKEXQEREEQQKQEEEKKVKYKIKTLTDKIDEINKDIDGINGKTIVG 
AEEVIDKITGPVYDDFTDGNKAIYKTWGDLEDEEGEELGKLLKELSDTRHNLRTKLNEGNKAYIVLEKEPNLKENV 
NVSDIQSDLEKLKSGLEEVKKYFENEDNFEEIKGYIEDSNSY ' 

fS-lS.nt 

TAACTTATGA ATAAGAAAAT GAAAATGTTT ATTATTTGTG CTGTTTTTGC ATTGATGATT 

TCTTGCAAGA ATTATGCAAG TGGTGAAAAT CTAAAAAATT CAGAACAAAA TCTAGAAAGT 

TCAGAACAAA ATGTAAAAAA AACAGAACAA GAGATAAAAA AAC AAGTTGA AGGATTTTTA 

G AAATTC TAG AGACAAAAGA TTTATCTAAA TTAGATGAAA AAG AT AC AAA AGAAATTGAA 

AAACAAATTC AAGAATTAAA GAATAAAATA GAAAAATTAG ATTCTAAAAA AACTTCTATT 

GAAACATATT CTGAGTATGA AGAAAAAATA AACAAAATAA AAGAAAAATT - GAAAGGAAAA 

GGACTTGAAG ATAAATTTAA GGAGCTTGAA GAGAGTTTAG CAAAGAAAAA GGGGGAGAGA 

AAAAAAGCTT TACAAGAGGC • CAAACAGAAA TTTGAAGAAT ATAAAAAACA AGTAGATACT 

TCAACTGGGA AAACTCAAGG CGACAGGTCT AAAAACCGAG GTGGTGTTGG AGTGCAAGCT 

TGGCAGTGTG CCAATGAATT AGGTTTGGGT GTAAGTTATT CTAATGGCGG CAGTGACAAC 

AGCAATACTG ATGAATTAGC AAACAAAGTT ATAGATGATT CTCTTAAAAA GATTGAAGAA 

GAACTTAAGG GAATAGAAGA AGATAAAAAA GAATAA 

t5-15.nt 

TTGCAAGAATTATGCAAGTGGTGAAAATCTAAAAAATTCAGAACAAAATCTAGAAAGTTCAGAACAAAATGTAA^ 
AAAACAGAACAAGAGATAAAAAAACAAGTTGAAGGATTTTTAGA^TTCTAGAGACAAAAGATTTATCTAAATTAG 
ATGAAAAAGATACAAAAGAAATTGAAAAACAAATTCAAGAATTAAAGAATAAAATA 

AACTTCTATTGAAACATATTCTGAGTATGAAGAAAAAATAAACAAAATAAAAGAAAAATTGAAAGGAAAAGGACTT 
GAAGATAAATTTAAGGAGCTTGAAGAGAGTTTAGCAAAGAAAAAGGGGGAGAGAAAAAAAGCTTTACAAGAGGCCA 
AACAGAAATTTGAAGAATATAAAAAACAAGTAGATACTTCAACTGGGAAAACTCAAGGCGACAGGTCTAAAAACCG 
AGGTGGTGTTGGAGTGCAAGCTTGGCAGTGTGCCAATGAATTAGGTTTGGGTGTAAGTTATTCTAATGGCGGCAGT 
GACAACAGCAATACTGATGAATTAGCAAACAAAGTTATAGATGATTCTCTTAAAAAGATTGAAGAAGAACTTAAGG 
GAATAGAAGAAGATAAAAAAGAA 

f5-15.aa 

LMNKKMKMFI ICAVFALMIS CKNYASGENL KNSEQNLESS EQNVKKTEQE IKKQVEGFLE 
ILETKDLSKL DEKDTKEIEK QIQELKNKIE KLDSKKTSIE TYSEYEEKIN KIKEKLKGKG - 
LEDKFKELEE SLAKKKGERK KALQEAKQKF EEYKKQVDTS TGKTQGDRSK NRGGVGVQAW 
QCANELGLGV SYSNGGSDNS NTDELANKVI DDSLKKIEEE LKGIEEDKKE 

t5-15.-aa 

r 

CKNYASGENLKNSEQNLESSEQNVKKTEQEIKKQVEGFLEILETKDLSKLDEKDTKEIEKQIQELKNKIEKLDSKK 
TSIETYSEYEEKINKIKEKLKGKGLEDKFKELEESLAKKKGERKKALQEAKQKFEEYKKQVT)TSTGKTQGDRSKNR 
GGVGVQAWQCANELGLGVSYSNGGSDNSNTDELANKVIDDSLKKIEEELKGIEEDKKE 

f 51-2 .nt 

TAATTGTTTG GGGTTGTGGT AAACTTAAGG CTTATGGAGT GGATTATGAA TAAAAAAATG 

AAAATATTTA. ' TTATTTGTGC TGTATTTGTG CTGATAAGTT CTTGCAAGAT TGATGCAACT 

GGTAAAGATG CAACTGGTAA AGATGCAACT GGTAAAGATG CAACTGGTAA AGATGCAACT 

GGTAAAAATG CAGAACAAAA TATAAAAGGG AAAGTTCAAG GATTTTTAGA AAAGATTTTA 
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GATCCAGTAA AGGATAAAAT TGCTTCAAAT GGTCCAATAG CAGATGAATT GGCAAAAAAA 

TTACAAGAAG AAGAAAAGGT AAATAACGGG GAAGAAGAAA ATGATAAAGC TGTC TTTTT A 

GGAGAAGAAT CAAAAGAGGA TGAAGAAGAA AATGAGCAAG CTGTTAATTT AGAAGAAAAA 

AATGCGGAAG AGGATAAGAA AGTTGTTAAT TTAGAAGAGA AAGAATTAGA AGTTAAAAAA 

GAGACTGAAG AAGATGAAGA TAAAGAAGAA ATAGAGAAAC AAAAACAAGA AGTGGAAAAA 

GCACAAGAAA GAAAACAACG ACAAGAAGAA AAGAAACGAA AAAAACAAGA ACAGCAAGAA 

GAAAAGAAAC GAAAACGACA AGAACAAAGA AAAGAAAGGA GAGCTAAAAA CAAAATTAAA 

AAACTTGCGG ATAAAATAGA TGAGATAAGT TGGAATATTG ATGGTATAGA AAGTCAAACA 

AGTGTAAAAC CGAAAGCAGT TATAGATAAA ATTACGGGGC CTGTATATGA TTATTTTACC 

GATGACAACA AAAAAGCTAT ATATAAAACA TGGGGAGATT TAGAAGATGA AGAAGGCGAA 

GGATTGGGAA AATTATTGAA AGAATTGAGT GATACTAGAG ATGAGTTAAG AACCAAATTA 

AATAAAGATA ATAAAAAATA TTATGCCCAT GAAAATGAGC CTCCTCTAAA AGAAAATGTA 

GATGTCAGCG AAATTAAAGA AGATTTAGAA AAAGTAAAAT CAGGATTAGA AAAGGTTAAA 

GAATATCTTA AAGACAATTC TAAATTTGAA GAAATTAAAG GATAC ATC AG TTACAGTCAG 

TAA * - . 

t51-2.nt / 

TTGCAAGATTGATGCAACTGGTAAAGATGCAACTGGTAAAGATGCAACTGGTAAAGATGCAACTGGTAAAGATGCA 

ACTGGTAAAAATGCAGAACAAAATATAAAAGGGAAAGTTCAAGGATTTTTAGAAAAGATTTTAGATCCAGTAAAGG. 

ATAAAATTGCTTCAAATGGTCCAATAGCAGATGAATTGGCAAAAAAATTACAAGAAGAAGAAAAGGTAAATAACGG 

GGAAGAAGAAAATGATAAAGCT'GTCTTTTTAGGAGAAGAATCAAAAGAGGATGAAGAAGAAAATGAGCAAGCTGTT 

AATTTAGAAGAAAAAAATGCGGAAGAGGATAAGAAAGTTGTTAAT 

AGACTGAAGAAGATGAAGATAAAGAAGAAATAGAGAAACAAAAACAAGAA 

ACGACAAGAAGAAAAGAAACGAAAAAAACAAGAACAGCAAGAAGAAAAGAAACGA 

GAAAGGAGAGCTAAAAACAAAATTAAAAAACTTGCGGATAAAAT 

AAAGTCAAACAAGTGTAAAAGCGAAAGCAGTTATAGATAAAATTACGGGGCCTGTATATGATTATTTTACCGATGA' 
CAACAAAAAAGCTATATATAAAACATGGGGAGATTTAGAAGATGAAGAAGGCGAAGGATTGGGAAAATTATTGAAA 
GAATTGAGTG AT AC TAGAG ATGAGTTAAG AACCAAATT AAA TAAAG AT AATAAAAAAT ATT ATGCCCATGAAAA.TG 
AGCCTCCTCTAAAAGAAAATGTAGATGTCAGCGAAATTAAAGAAGATTTAGAAAAAGTAAAATCAGGATTAGAAAA 
GGTTAAAGAATATCTTAAAGACAATTCTAAATTTGAAGAAATTAAAGGATACATCAGTTACAGTCAG 



f51-2.aa : 

LFGVWNLRL MEWIMNKKMK IFIICAVFVL ISSCKIDATG KDATGKDATG KDATGKDATG 

KNAEQNIKGK VQGFLEKILD PVKDKIASNG P I ADELAKKL QEEEKVNNGE -EENDKAVFLG 

EESKEDEEEN EQAVNLEEKN AEEDKKWNL EEKELEVKKE TEEDEDKEEI EKQKQEVEKA 

QERKQRQEEK KRKKQEQQEE KKRKRQEQRK ERRAKNKIKK LADKIDEXSW NIDGIESQTS 

VKPKAVIDKI TGPVYDYFTD DNKKAIYKTW GDLEDEEGEG LGKLLKELSD TRDELRTKLN 

KDNKKYYAHE NEPPLKENVD VSEIKEDLEK " VKSGLEKVKE YLKDNSKFEE IKGYISYSQ 

t51-2 .aa * ^ ' 

CKIDATGKDATGKDATGKDATGKDATGKNAEQNIKGKVQGFLEKILDPVKDKIASNGPIADELAKKLQEEEKVNNG 
EEENDKAVFLGEESKEDEEENEQAVNLEEKNAEEDKXVVNLEEKELEVKKETEEDEDKEEIEKQKQEVEKAQ 
RQEEKKRKKQEQQEEKK^RQEQRKERRAKNKIKKLADKIDEISWNIDGIESQTSVKPKAVIDKITGPVYDYFTDD 
NKKAIYKTWGDLEDEEGEGLGKLLKELSDTRDELRTKLNKDNKKYYAHENEPPLKENVDV-SEIKED 
VKEYLKDNSKF EE IKGYISYSQ 

f 6-21 .'nt 

TAGGCAAAAT TTAAATTTAT AAAAACTTGT AAGGATGCTT GTATGAAAAT ATTGATAAAA 

AAGTTAAAAG TTGTATTATT TCTCAATTTA ATTTTACTTA TTTC TTGTGT TAATGAAAGT 

AATAGAAACA AATTGGTTTT TAAGCTAAAT ATTGGAAGTG AGCCTGCTAC TTTAGATGCT 

CAATTAATAA' ACGATACGGT TGGATCAGGG ATTGTAAGCC AAATGTTTCT TGGCATTTTA 

GATGGAGATC CCAGGACTGG AGGATACAGA CCGGGACTTG CTAAAAGTTG GGATATTTCT 
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GATGACGGAG T AGTTT AT AC GTTTCATTTA AGAGATAATC TTGTTTGGAG TGATGGAGTT 
TCCATTACTG CCGAAGAATA A 

t6-21.nt 

TTGTGTTAATGAAAGTAATAGAAACAAATTGGTTTTTAAGCTAAATATTGGAAGTGAGCCTGCTACTTTAGATGCT 
CAATTAATAAACGATACGGTTGGATCAGGGATTGTAAGCCAAATGTTTCTTGGCATTTTAGATGGAGATCCCAGGA 
CTGGAGGATACAGACCGGGACTTGCTAAAAGTTGGGATATTTCTGATGACGGAGTAGTTTATACGTTTCATTTAAG 
AGATAATCTTGTTTGGAGTGATGGAGTTTCCATTACTGCCGAAGAA 

f6-21.aa 

AKFKFIKTCK DACMKILIKK LKWLFLNLI LLISCVNESN RNKLVFKLNI GSEPATLDAQ 
LINDTVGSGI VSQMFLGILD GDPRTGGYRP GLAKSWDISD DGWYTFHLR DNLVWSDGVS 
ITAEE ■ . ■ 

t6-21.aa 

- CVNESNRNKLVFKLNIGSEPATLDAQLINDTVGSGIVSQMFLGILDGDPRTGGYRPGLAKSWDISDDGVVYTFHLR 
DNLVWSDGVS ITAEE 

f6-27.nt 

TAAAGAAAAG CTTGCATAAA AAGTATAACA AATTCTTTAA TAATTAAAAT CAAAAAGAAT 

ATAATTATTG C AC T AAAATT AAATTTATAC AGTTATATAG AATCACTTAA GGAACAAAAA 

ATGAAATACC TT AAAAAC AT TTCCTTATTT TTGTTAATTT TAGGTTGCAA ATCCATCCCA 

AATGGTAATT TCAATCTACA CGATACAAAC CATAAATTAG GAAAACTAAA ATTTCAAGAA 

GACTCGATAA TAAGCAGAAA TTATGATAAT AAAATATCCA TTGTGGGAGT ATACAACCCT 
TTAACAGAAA AAGAAAATTT TAAAGTCAAT ATTTTCATCA AAAAAAAAGG- ATTACAAATA J 

GATCCTGAAA ATATTTTGAT AAATGAAGAA AAAATTAATT ATTCAAAATA TAAAGCAGAA 

CTCAAAGTAA AATCTAGCTT TAATAAAAGC ATTATCAGTA TTTCACTAAC TAATTCAAGA ' 

GATCTATTAA CCTACATTTA CGATAAAAGC AC AGGG AAAT ACATTAACAT TGACTTTAAG 

GACAATTGGA ACGTATCGCA CAGTATAAAA TTTAATAAGG AGTATATTTT AGCATATATA 

ACAGATTTTG ATAAAGAAAT TAAAATATCT AAAAATATTT TGCAAAAACG TATTGATAAT 

AGAAAAATTG AAATTGAAAA AACAGAGCTT AAAACAGAAT ATAATGAAAT AGAGGATTAT 

TACATCTACA GTATGAAAAT TCCAAAATTA TTTGAAAAAT -CAGACGCTCC • CTCTGAAACT 

TACGAAACAT TTGTTATAGC AAATTATTAC CCCTGTGAAA ATTTAAATAT ACTGTTTTTG 

AATTTAAGCT TATACTCTGA . TAAATTACGC TTTCTAAACT CTATTTATGA TGAGAATGAT 

AGAAAATTAA AAATGGAGCC TCCTGTGAGA GCCTTAAAGA ATTCAAAAAC AATAAAAGAA 

ACATTAAATA TAGTATTAAG TCCTCAAAAA ATAATAGAGC TAGCAAAAAA CATTGAAAAA 

GAT ATT AC TC T AAAATT AAA ATCTTACGGA GAAAAGGGAG AATTCACATT TGAAATATAT 

AAACCACTTC TTTT AAAATT CTTAAAAGAA GTAGATCATT GCATAAAAAA TTTGCAATCA 
AGTAGGCATA AATTTTAA 

t6-27.nt 

TTGCAAATCCATCCCAAATGGTAATTTCAATCTACACGATACAAACCATAAATTAGGAAAACTAAAATTTCAAGAA 
GACTCGATAATAAGCAGAAATTATGATAATAAAATATCCATTGTGGGAGTATACAACCCTTTAACAGAAAAAGAAA 
ATTTTAAAGTCAATATTTTCATCAAAAAAAAAGGATTACAAATAGATCCTGAAAATATTTTGATAAATGAAGAAAA 
AATTAATTATTCAAAATATAAAGCAGAACTCAAAGTAAAATCTAGCTTTAATAAAAGCATTATCAGTATTTCACTA 
ACTAATTCAAGAGATCTATTAACCTACATTTACGATAAAAGCACAGGGAAATACATTAACATTGACTTTAAGGACA 
ATTGGAACGTATCGCACAGTATAAAATTTAATAAGGAGTATATTTTAGCATATATAACAGATTTTGATAAAGAAAT 
TAAAAT ATC TAAAAATATTTTGC AAAAAC GT ATTGAT AAT AG AAAAATTG AAATTGAAAAAAC AGAGC TT AAAAC A 
G AAT AT AATG AAAT AG AGGATTATT AC ATCT AC AGTATGAAAATTC C AAAATTATTTGAAAAATC AGACGC TCC C T 
CTGAAACTTACGAAACATTTGTTATAGCAAATT ATT ACCCCTGTGAAAATTTAAATAT ACTGTTTTTG AATTTAAG 
CTTATACTCTGATAAATTACGCTTTCTAAACTCTATTTATGATGAGAATGATAGAAAATTAAAAATGGAGCCTCCT 
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GTGAGAGCCTTAAAGAATTCAAAAACAATAAAAGAAACATTAAATATAGTATTAAGTCCTCAAAAAATAATAGAGC 
TAGCAAAAAACATTGAAAAAGATATTACTCTAAAATTAAAATCTTACGGAGAAAAGGGAGAATTCACATTTGAAAT 
ATATAAACCACTTCTTTTAAAATTCTTAAAAGAAGTAGATCATTGCATAAAAAATTTGCAATCAAGTAGGCATAAA 
TTT 

f6-27.aa 

RKACIKSITN SLIIKIKKNI I I ALKLNLYS YIESLKEQKM KYLKNXSLFL LILGCKSJPN 
GNFNLHDTNH KLGKLKFQED SIISRNYDNK ISIVGVYNPL TEKENFKVNI FIKKKGLQID 
PENILINEEK INYSKYKAEL KVKSSFNKSI ISISLTNSRD LLTYIYDKST GKYINIDFKD 
NWNVSHSIKF NKEYILAYIT DFDKEIKISK^ NILQKRIDNR KIEIEKTELK TEYNEIEDYY 
IYSMKIPKLF EKSDAPSETY ETFVIANYYP CENLNILFLN LSLYSDKLRF LNS I YDENDR 
KLKMEPPVRA LKNSKTIKET LNIVLSPQKI IELAXNIEKD ITLKLKSYGE KGEFTFEIYK 
PLLLKFLKEV DHCIKNLQSS RHKF 

t6-27.aa 

CKS I PNGNFNLHDTNHKLGKLKFQEDS I I SRNYDNKI S I VGVYNPLTEKENFKVNI F I KKKGLQ IDPENI LINEEK 
INYSKYKAELKVKSSFNKSIISISLTNSRDLLTYIYDKSTGKYINIDFKDNWNVSHSIKFNKEYILAYITDFDKEI 
KISKNILQKRIDNRKIEIEKTELKTEYNEIED.YYIYSMKIPKLFEKSDAPSETYETFVIANYYPCENLNILFLNLS 
LYSDKLRFLNSIYDENDRKLKMEPPVRALKNSKTIKETLNIVLSPQKIIELAKNIEKDITLKLKSYGEKGEFTFEI 
YKPLLLKFLKEVDHCIKNLQSSRHKF t 

. f6-5.nt . - 

AGTTTTTAAT ATCCGTTTAT TTTTTATTGT TTTATGGTTG TTCAACTATA 

AAATACCAGA AAAAGATAAA ATAAATTTAA CTGTTTTATC ATCTTTAATG 

ATTTGAAGAT TTCAAATTTT AAAATAAAAG ACTACGAACA TTTGCATTAT 

TTGAAAGCTT GAGTGATACT AAAAATAGTG CTTATATTTA CGTTGATGAA 

ATAATAATAT TAATTTTATT AAAGATCTTT TTATTTATAA TAAGAAATTA 

TTAT.TGCTTA TAGCTTGACC CAAGGTGCAT CTTTTAAGGC AGAAGTTTTA 

AAAAACAAAA AATTATGAAA AATTTTTCAT TGAAAATAAA ' TTTTCCAACT 

TTATGGATAA TAAGTATTGG ATTGTAATTG CAAAAAACCA TTTAGATTCT 

GTAAAAATTA TTTAGTCTTG GCGAATGTAA AGATGGAATA TATACTCAAA 

CTTGA - ■ 

t6-5 .nt 

TTGTTCAACTATATCTTTGGTAAAAATACCAGAAAAAGATAAAATAAATTTAACTGTTTTATCATCTTTAATGAAT" 
TATCCTGATTTGAAGATTTCAAATTTTAAAATAAAAGACTACGAACATTTGCATTATTCATCTGATTTTGAAAGCT 
TGAGTGATACTAAAAATAGTGCTTATATTTACGTTGATGAATCTAGTTTCAATAATAATATTAATTTTATTAAAGA 
TCTTTTTATTTATAATAAGAAATTATATAGAATACTTATTGCTTATAGCTTGACCCAAGGTGCATCTTTTAAGGCA 
GAAGTTTTATCTTATCTTGAAAAACAAAAAATTATGAAAA 

AATTTATGGATAATAAGTATTGGATTGTAATTGCAAAAAACCATTTAGATTCTCTTGTTAAGAGTAAAAAT 
f6-5.aa 

MKKFLISVYF LLFYGCSTIS LVKIPEKDKI NLTVLSSLMN 'YPDLKISNFK IKDYEHLHYS 
SDFESLSDTK NSAYIYVDES SFNNNINFIK DLFIYNKKLY RILIAYSLTQ GASFKASVLS 
YLEKQXIMKN FSLKINFPTA KKFMDNKYWI VIAKNHLDSL VKSKNYLVIiA NVKMEYILKK 
FLT 

t6-5.aa 

CSTISLVKIPEKDKINLTVLSSLMNYPDLKISN^ 
LFIYNKKLYRILIAYSLTQGASFKAEVLSYLEKQKIMKNFSL^ 



TAAATGAAGA 
TCTTTGGTAA 
AATTATCCTG 
TCATCTGATT 
TCTAGTTTCA 
TATAGAATAC 
TCTTATCTTG 
GCTAAAAAAT 
CTTGTTAAGA 
AAGTTT.TTAA 
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£7-30. nt 

TAGAGACGAA GTCACAAGCA AAATGTTAAA AGATTTACAA AATCAAGTTC AAGGGGGCAA 
ATAATGAAAA ATTTAAAGAC AAAAATTAAT TTTTTAGGGA TATTTTGGCT ACTGTTACTA 
TTTCTTTCTT GCGAATCAAT ACCATCACTT CCCCAAAAAC CAACCCTAAC AAACAAAGAA 
GATATTGAAA ATTTAATGCT CGATGAAGCA GAACTTTTTA GATACTCAAC CGCACTAAAT 
GTTTGGCTTT TG AC TGT AAA ATCTTATGTG ATCAAATACT ATCCTAATGA CAAATTTCCT 
GTGTTTGAAA ATTTTGATCC CGTGTTTGGC GATGAAAATG GAACTAAAGA AACAAATATA . 
CTAAAAAATC GAATTACCTA CTACAATCGA TACATAGAAA AAACCGAACC GATTGTATTT 
GGGTGTTACA AAAAATACAG CAGAAGATAA 

t7-30.nt 

TTGCGAATCAATACCATCACTTCCCCAAAAACCAACCCTAACAAACAAAGAAGATATTGAAAATTTAATGCTCGAT 
GAAGCAGAACTTTTTAGATACTCAACCGCACTAAATGTTTGGCTTTTGACTGTAAAATCTTATGTGATCAAATACT 
ATCCTAATGACAAATTTCCTGTGTTTGAAAATTTTGATCCCGTGTTTGGCGATGAAAATGGAACTAAAGAAACAAA 
TATACTAAAAAATCGAATTACCTACTACAATCGATACATAGAAAAAACCGAACCGATTGTATTTGGGTGTTACAAA 
AAATACAGCAGAAGA 

£7-30. aa. 

RRS HKQNVKR FTKSSSRGQI MKNLKTKINF LGIFWLLLLF LSCESIPSLP QKPTLTNKED 

IENLMLDEAE LFRYSTALNV WLLTVKSYVI KYYPNDKFPV FENFDPVFGD ENGTKETNIL 

KNRITYYNRY IEKTEPIVFG CYKKYSRR 

t7-30.aa 

CESIPSLPQKPTLTNKEDIENLMLDEAELFRYSTALNVWLLTVKSYVIKYYPNDKFPVFENFDPVFGDENGTKETN 
ILKNRITYYNRYIEKTEPIVFGCYKKYSRR 

£76-1. nt . 

TGAATATTAA TAATAAAAAA AGGAGTAACA ATGAAAATTA TCAACATATT ATTTTGTTTG 

TTTTTACTAA TGCTAAACGG CTGTAATTCT AATGATACAA AT AC C AAGC A GACAAAAAGC 

AGACAAAAGC GTGATTTAAC CCAAAAAGAA GCAACACAAG AAAAACCTAA ATCTAAATCT 

AAAGAAG AC C TGCTTAGAGA AAAGCTATCT GATGATCAAA AAACACAACT TGACTGGTTA 

AAAACCGCTT TAACTGGTGT TGGAAAATTT GATAAATTCT TAGAAAATGA TGAAGGCAAA 

ATTAAATC AG ■ CACTTGAACA TATAAAGACT GAAC TTG AT A AATGTAATGG AAATGATGAA 

GGAAAAAACA CCTTCAAAAC TACCGTTCAA GGGTTTTTTA GCGGCGGCAA TATAGATAAT 
TTTGCAGATC AAGCAACTGC TACCTGCAAT TAA 

t76-l.nt 1 

CTGTAATTCT AATG AT AC AAAT AC C AAGC AGACAAAAAGCAGACAAAAGCGTGATTTAACCCAAAAAGAAGCAAC A 

CAAGAAAAACCTAAATCTAAATCTAAAGAAGACCTGCTTAGAGAAAAGCTATCTGATGATCAAAAAAC 

ACTGGTTAAAAACCGCTTTAACTGGTGTTGGAAAATTTGATAAATTCTTAGAAAATGATGAAGGCAAAATTAAATC 

AGCACTTGAACATATAAAGACTGAACTTGATAAATGTAATGGAAATGATGAAGGAAAAAACACCTTCAAAACTACC 

GTTCAAGGGTTTTTTAGCGGCGGCAATATAGATAATTTTGCAGATCAAGCAACTGCTACCTGCAAT 

£76-1. aa 

ILIIKKGVTM KIINILFCLF LLMLNGCNSN DTNTKQTKSR QKRDLTQKEA TQEKPKSKSK 
EDLLREKLSD DQKTQLDWL.K TALTGVGKFD KFLENDEGKI KSALEHIKTE LDKCNGNDEG 
KNTFKTTVQG FFSGGNXDNF ADQATATCN 



t76-l .aa 
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CNSNDTNTKQTKSRQKRDLTQKEATQEKPKSKSKEDLLREKLSDDQKTQLDWLKTALTGVGKFDKFLENDEGKIKS 
ALEHIKTELDKCNGNDEGKNTFKTTVQGFFSGGNIDNFADQATATCN 

f8-10.nt 

TAAGTAAGGA GAATATTTAT GAAATATAAT ACGATTATAA GCATATTTGT TTGTTTGTTT 

TTAACTGCTT GCAATCCAGA TTTTAACACA AATAAGAAAA GAACTCTAAG TAAGGGGATA 

ATTTCAAATC AAGATGCAGA TTCTGATAAA ATAATAAAAA ATAAATTACT TGATGATTTA 

ATAAATTTAA TAGAAAAAGC GAATGCAGAT AGAGAAAAAT ATGTAAAAAA AATGGAAGAA 

GAACCTTCGG ATCAATATGG AATGTTGGCT GTTTTTGGAG GTATGTATTG GGCAGAATCA 

CCACGGGAAT TAATATCTGA - TACAGGTAGT GAGAGATCTA TTAGGTATAG AAGGCGTGTT 

TATAGTATTT TATTAAATGC .TATTGAAACT AATGAATTAA AGAAATTTTC AGAAATTAGA 

ATACTGTCAA TAAAAGTACT AGAAATATTT AGCCTATTTA ATCTATTTGG AAGTACTCTT 

GATGATGTGG TTGTTCACTT ATATTCCAAA AAAGATACTC TAGGTAAACT AGATATTTCA 

Q AATTTAAAAA GACTTAAAAA TTTGTTTGAA AAATTATTAT C T AT AAAAAC AATCGTTTCA. 

St) ' AAGATGTCAA AACGTCTTTT ATTGGATTAT CAAAATAATG AAAATTTTAT AAAAACAGAT 

AACGCCAAGC ' TTGGATCTTA TGTGGTTGC A * CTTTCCAATC AAATTCAAGA AAAATATAAT ' 

IgJ GAAGCAGAAA GGCTGAAAAG C GAGAT AATT TTAATATATA CCCTTTAA 

f|j t8-10.nt 

W" " . " 

TTGCAATCCAGATTTTAACACAAATAAGAAAAGAACTCTAAGTAAGGGGATAATTTCAAATCAAGATGCAGATTCT 

v GATAAAATAATAAAAAATAAATTACTTGATGATTTAATAAATTTAATAGAAAAAGCGAATGCAGATAGAGAAAAAT 

ATGTAAAAAAAATGGAAGAAGAACCTTCGGATCAATATGGAATGTTGGCTGTTTTTGGAGGTATGTATTGGGCAGA 

ATCACCACGGGAATTAATATCTGATACAGGTAGTGAGAGATCTATTAGGTATAGAAGGCGTGTTTATAGTATTTTA 

TTAAATGCTATTGAAACTAATGAATTAAAGAAATTTTCAGAAATTAGXATACTGTCAATAAAAGTACTAGAAATAT 

TTAGCCTATTTAATCTATTTGGAAGTACTCTTGATGATGTGGTTGTTCACTTATATTCCAAAAAAGATACTCTAGG 

TAAACTAGATATTTCAAATTTAAAAAGACTTAAAAATTTGTTTGAAAAATTATTATCTATAAAAACAATCGTTTCA 

AAGATGTCAAAACGTCTTTTATTGGATTATCAAAATAATGAAAATTTTATAAAAACAGATAACGCCAAGCTTGGAT 

CTTATGTGGTTGCACTTTCCAATCAAATTCAAGAAAAATATAATGAAGCAGAAAGGCTGAAA 

f8-10.aa 

VRRI FMKYNT IISIFVCLFL TACNPDFNTN KKRTLSKGII SNQDADSDKI IKNKLLDDLI 
NLIEKANADR EKYVKKMEEE PSDQYGMLAV FGGMYWAESP RELISDTGSE 'RSIRYRRRVY 
SILLNAIETN ELKKFSEIRI LSIKVLEIFS LFNLFGSTLD DVWHLYSKK DTLGKLDISN 
LKRLKNLFEK LLSIKTIVSK MSKRLLLDYQ NNENFIKTDN AKLGSYWAL SNQIQEKYNE 
AERLKSEIIL IYTL 

t8-10.aa 

CNPDFNTNKKRTLSKGIISNQDADSDKIIi^KLLD^ 

SPREIilSDTGSERSIRYRRRVYSILLNAIETNELKKFSEIRILSIKVLEIFSLFNLFGSTLDDVVVHLYSKKDTLG 
KLDISNLKRLKNLFEKLLSIKTIVSKMSKRLLLDYQNNENFIKTDNAKLGSYWALSNQIQEKYNEAERLK 

f8-14.nt 

TAATATATAT TCTTGATTAA GGG AAAGGAG AGTATTTTTA TGAAAAAAAA AATGTTTTTA 
TAT AC ATTGT TAACGATAGG ATTGATGTCT TGTAATCTAA ATTCTAAATT ATCTGGTAAT 
AAAGAGGAAC AAAAAAATAA C AATGAT AT A AAAGAAGCTT TAAATGGCGT TCAAGAAAAT 
GCTATTAATA ATTTATATGG AAATAAAAAA GAAAAAAAAG ATTTTATTAA AAATTCGGAA 
AAATTGAAAG ACAAGGGTTT AGACGTGACC ACCCTCCCCT TAGAACCTGT AGTGGCGCCC 
TCCGTAGAAT CTGCGGTGTC TTTAGGAGAA TCTAATAATA GGATTGGTAT ACCAACCATT 
TCAATTGAGC ATAATCAAAA AAAAGAGATA AAAGAAGAGG ATTTTTTCCC TTCTACTGAG 
GAAGAAAAGC AAGCGGATAA AGCAATTAAA GATATAGAGA ATCTTATTGG AGAATCTGGA 



m 
si 
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TTTCCCGAGT TAATTGAGAA TGTGTGCTCA CTTAAACATG AATATACTTT AATAAGAAGT 
GATTTTTATG ATGTGATAAC TAAGATTCAG AATAAAAAAA TATCACTAAT GAAAAATTCT 
CATAATAATA GAAATAAAAT AAGGGAACTA GTACAATTGC AAAATAATTT AAAGATAGGA 
GACGAACTTG ATAAAATTAT GGGTTGCATT GATACTGCAG AACAAGAGAT AAGATCTGCC 
GCTTTCTTTT TTGATGAAGC TAAGGAAAGC TTAAAAGAAG GTATTATTAA AAGATTGGAA 
AAAAGTAAAA ATAGGGCAGC ATCACAATTA TCTAAAAAGG CTTTAAATAG AGCAGAGGAT 
GCTTTAAGGT GCTTAGAAAA TTATTCTTCT AAAAAAGGTG AGGCAATAGG AAGAAGAAGC 
TTTATAAAAG AAGTTGTTGA ACAGGCAAAA AATGCTTTAA GTAAGTCTTA A 

t8-14.nt" 

TTGTAATCTAAATTCTAAATTATCTGGTAATAAAGAGGAACAAA 
GGCGTTCAAGAAAATGCTATTAATAATTTATATGGAAATAA 

AATTGAAAGACAAGGGTTTAGACGTGACCACCCTCCCCTTAGAACCTGTAGTGGCGCCCTCCGTAGAATCTGCGGT 

GTCTTTAGGAGAATCTAATAATAGGATTGGTATACCAACCATTTCAATTGAGCATAATCAAAAAAAAGAGATAAAA 

GAAGAGGATTTTTTCCCTTCTACTGAGGAAGAAAAGCAAGCGGATAAAGCAATTAAAGATATAGAGAATCTTATTG 

GAGAATCTGGATTTCCCGAGTTAATTGAGAATGTGTGCTCACTTAAACATGAATATACTTTAATAAGAAGTGATTT 

TTATGATGTGATAACTAAGATTCAGAATAAAAAAATATCACTAATGAAAAATTCTCATAATAATAGAAATAAAAT 

AGGGAACTAGTACAATTGCAAAATAATTTAAAGATAGGAGACGAACTTGATAAAATTATGGGTTGCATTGATACTG 

CAGAACAAGAGATAAGATCTGCCGCTTTCTTTTTTGATGAAGCTAAGGAAAGCTTAAAAGAAGGTATTATTAAAAG 

ATTGGAAAAAAGTAAAAATAGGGCAGCATCACAATTATCTAAAAAGGCTTTAAATAGAGCAGAGGATGCTTTAAGG 

TGCTTAGAAAATTATTCTTCTAAAAAAGGTGAGGCAATAGGAAGAAGAAGCTTTATAAAAGAAGTTGTTGAACAGG 

CAAAAAATGCTTTAAGTAAGTCT 

f8-14.aa ■ 

YIFLIKGKES IFMKKKMFLY TLLTIGLMSC NLNSKLSGNK EEQKNNNDIK EALNGVQENA 
INNLYGNKKE KKDFIKNSEK LKDKGLDVTT LPLEPWAPS VESAVSLGES NNRIGIPTIS 
IEHNQKKEIK EEDFFPSTEE EKQADKAIKD IENLIGESGF PELIENVCSL KHEYTLIRSD 
FYDVITKIQN KKISLMKMSH NNRNKIRELV QLQNNLKIGD ELDKIMGCID TAEQEIRSAA 
FFFDEAKESL KEGIIKRLEK SKNRAASQLS KKALNRAEDA LRCLENYSSK KGEAIGRRSF 
IKEWEQAKN ALSKS 

t8-14 .aa 

CNLNSKLSGNKEEQKNNNDIKEALNGVQENAI 

SLGESNNRIGIPTISIEHNQKKEIKEEDFFPSTEEEKQADKAIKDIENLIGESGFPELIENVCSLKHEYTLIRSDF 
YDVITKXQNKKISLMKNSHNNRNKIRELVQLQNNLKIGDELDKIMGCIDTAEQEIRSAAFFFDEAKESLKEGIIKR 
LEKSKNRAASQLSKKALNRAEDALRCLENYSSKKGEAIGRRSFIKEWEQAKNALSKS 

-fOlA.nt BB001 

TGATTAATTTTTTTTAAGGATTACGTTTTGAAAAGAAACAAAATTTGGAAAACGTTAAAACTGTTTCAAATAACTT 

TACTGTTCTCATGCTCTTTTTATTCTAAATCAAACAACACAGAAGCGATAAGTGAATTACAATCAAGCCCTATTAA 

ACTTGGAAAAATTAAAGTTTTACAAAAAACAGAAAAGATTGTAAGCACCCAAAATCTTCAAAACTTACA^ 

CAGTTCTTTAAAAATGAAAAAGAAAAAATAATTAAAAAAATTGCACAAGAATTTGATGAGAATGi^ 

A.TAAAATAGGTCCAAATATCGAAATGTTTGCTCAAACAATAAACACGGATATTCAAAAAATCGAACCTAATGATCA 

ATTTGGAATAAATAAAACTTTATTCA.CAGAAAAAAAAGA.CAATAATATTGACTTTATGTTAAAAGACAATCGACTT 

AGAAGATTATTTTACTCATCTTTAAATTATGATGAAAATAAAATCAAAiVAATTAGCCACAATACTCGCGCAAACAT 

CAAGCTCAAACGACTACCATTACACACTTATTGGTTTAATTTTTTGGACAGGATTTAAAATCCAAGAAGCATTTGA 

AAGCGCTGTTAATATTTTAACTAAAGACGAGCAAAAGCGCCTAATTTTTAATTTTAGAACAAAAACAGTAAAAGAG 

ATTCAGGAAAATTTTGAAAAACTAATGCAAGAGAGAAATTCATGGATAAAAATCGTCGATAACATTATTGGCGAAT 

ATG AC AAAAAT ACGGGAGGATGC AAAGC TG ATGG AAAAATTCTC GG AG AAGT AATAAGGGTTGG ATAGG AGC ATG A 

ACTCGACTCAAATAAAAGTATGCAAATTTTAAACAATATTGAAACACCGCTAAAAACCTGTTGTGACCACATACAC 

TACTAA 
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tOlA.nt BB001 

TGCTCTTTTTATTCTAAATCAAACAACACAGAAGCGATAAGTGAATTACAATCAAGCCCTATTAAACTTGGAAAAA 

TTAAAGTTTTACAAAAAACAGAAAAGATTGTAAGCACCC 

AAATGAAAAAGAAAAAATAATTAAAAAAATTGCACAAGAATTTGATGAGA 

CCAAATATCGAAATGTTTGCTCAAACAATAAACACGGATATTCAAAAAATCGAACCTAATGATCAATTTGGAATAA 
ATAAAACTTTATTCACAGAAAAAAAAGACAATAATATTGACTTTATGTTAAAAGACAATCGACTTAGAAGATTATT 
TTACTCATCTTTAAATTATGATGAAAATAAAATCAAAAAATTAGCCACAATACTCGCGCAAACATCAAGCTCAAAC 
GACTACCATTACACACTTATTGGTTTAATTTTTTGGACAGGATTTAAAATCCAAGAAGCATTTGAAAGCGCTGTTA 
ATATTTTAACTAAAGACGAGCAAAAGCGCCTAATTTTTAATTTTAGAACAAAAACAGTAAAAGAGATTCAGGAAAA 
TTTTGAAAAACTAATGCAAGAGAGAAATTCATGGATAAAAATCGTCGATAACATTATTGGCGAATATGACAAAAAT 
ACGGGAGGATGCAAAGCTGATGGAAAAATTCTCGGAGAAGTAATAAGGGTTGGATACGAGCATGAACTCGACTCAA 

ATAAAAGTATGCAAATTTTAAACAATATTGAAACACCGCTAAAAACCTGTTGTGACCACATACACTAC 

y 

fOlA.aa BB001 

liffkdwlkrnkiwktlklfqitllfscsfysksnnteaiselqsspiklgkikvlqktekivstqnlqnlqqsq 
ffknekekiikkiaqefdeneklinkigpniemfaqtintdiqkiep^qfgi 

rl'fysslnydenkikklatilaqtsssndyhytliglifwtgfkiqeafesavniltkdeqkrlifnfrtktvkei 
qenfeklmqernswikivdniigeydkntggckadgkilgevirvgyeheldsnksmqilnnietplktccdhihy 

tOlA.aa^ BB001 ' 

CSFYSKSITOTEAISELQSSPIKLGKIKVLQKTEKIVSTQNLQNLQQSQFFKNEKEKIIKKIAQEFDENEKLINKIG 
PNIEMFAQTINTDIQKIEPNDQFGINKTLFTEKKDNNIDFMLKDNRLRRLFYSSLNYDENKIKKLATILAQTSSSN 
DYHYTLIGLIFWTGFKIQEAFESAVNILTKDEQKRLIFNFRTKTVKEIQENFEKLMQERNSWIKIVDNIIGEYDKN 
TGGCKADGKILGEVIRVGYEKELDSNKSMQILNNIETPLKTCCDHIHY 

f02A.nt BB002 

TAATTAATACTGGTTTTAATTTATAAGGAGAGTATTTTGAAAAAAGCCAAACTAAATATAATCAAGATTAATATTA 
TTACAATGATATTAACTTTAATTTGCATCTCATGTGCACCTTTTAACAAAATCAATCCCAAGGCAAATGAAAACAC 
CAAGCTTAAAAAAAACACCAGACTGAAAAAACCCGCCAATCCAGGGGAAAA 

GGAGACCTTGGCGCTTCTGATGAAAAATTTATGGGAACTACCGCTTCAGAGCTAAAAGCAATTGGTAAGGAGCTAG 

AAGATCGAAAAAATCAATACGATATACAAATAGCCAAAATTACTAATGAAGAATCTAACCTATTAGATACTTATAT 

TCGGGCTTATGAACTAGCTAACGAAAATGAAAAAATGCTTTTAAAAAGATTTCTTCTTTCATCTTTAGATTATAAA 

AAAGAAAACATAGAGACATTAAAAGAAATTCTTGAAAAACTCATAAATAATTACGAAAACGACCCCAAAATTGCTG 

CAAATTTCCTTTATCGCATAGCGCTGGATATTCAATTAAAACTGGAAAAGCACTTAAAATCAATAAATGAAAAACT 

GGACACTCTAAGCAAAGAAAATTCAAAAGAAGATTTAGAGGCGTTGCTAGAACAAGTAAAATCTGCCTTACAGCTA 

CAAGAAAAGTTTAAAAAAACCCTAAACAAAACTCTTGAAGATTACCGTAAAAATACTAACAACATTCA^ 

AAGTACTAGCAGAACACTTTAATAAATATTACAAAGACTCTGATTCTTTACAATCTGCCTTTTATTAA 

t02A.nt BB002 . 

TGTGCACCTTTTAACAAAATCAATCCCAAGGCAAATGAAAACAC 

CCGCCAATCCAGGGGAAAACATCCAAAATTTTAAAGATAAATCTGGAGACCTTGGCGCTTCTGATGAAAAATTTAT 

GGGAACTACCGCTTCAGAGCTAAAAGCAATTGGTAAGGAGCTAGAAGATCGAAAAAATCAATACGATATACAAATA 

GC CAAAATTAC TAATG AAGAATCT AACC TATTAG AT ACTT ATATTCGGGCTT ATGAACT AGCTAAC GAAAATGAAA 

AAATGC TTTTAAAAAGATTTCTTC TTTC ATCTTTAGATT AT AAAAAAGAAAAC AT AG AGAC ATT AAAAGAAATTC T 

TGAAAAACTCATAAATAATTACGAAAACGACCCCAAAATTGCTGCAAATTTCCTTTATCGCATAGCGCTGGATATT 

CAATTAAAACTGGAAAAGCACTTAAAATCAATAAATGAAAAACTGGACACTCTAAGCAAAGAAAATTCA 

ATTT AGAGGC GTTGC TAGAAC AAGT AAAATCTGC C TT AC AGC T AC AAG AAAAGTTT AAAAAAAC CC T AAAC AAAAC 

TCTTGAAGATTAC CGT AAAAAT AC TAAC AAC ATTC AAGAAAAT AAAGTAC T AGC AGAAC AC TTTAATAAAT ATT AC 

AAAGACTCTGATTCTTTACAATCTGCCTTTTAT 



f02A.aa BB002 
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LILVLIYKESILKKAKLNIIKINIITMILTLICISCAPFNKINPKANENTKLKKNTRLKKPANPGENIQNFKDKSG 
DLGASDEKFMGTTASELKAIGKELEDRKNQYDIQIAKITNEESNLLDTYIRAYELANENEKMLLKRFLLSSLDYKK. 
ENIETLKEILEKLINNYENDPKIAANFLYRIALDIQLKLEKHLKSINEKLDTLSKENSKEDLEALLEQVKSALQLQ 
EKFKKTLNKTLEDYRKNTNNIQENKVLAEKFNKYYKDSDSLQSAFY 

t02A. aa BB002 

CAPFNKINPKANENTKLKKNTRLKKPANPGENIQNFKDKSGDLGASDEKFMGTTASELKAIGKELEDRKNQYDIQI 
AKITNEESNLLDTYIRAYELANENEKMLLKRFLLSSLDYKKENIETLKEILEKLINNYENDPKIAANFLYRIALDI 
QLKLEKHLKSINEKLDTLSKENSKEDLEALLEQVKSALQLQEKFKKTLNKTLEDYRKNTNNIQENKVLAEHFNKYY 
KDSDSLQSAFY 

f03A.nt BB006 

TGATTTAATGTAAATTTTAATTACCGCCTAAAAAAGGCTTTAAATGGTATAAAGGAAGAAGATCTAATGGTATTTA 
GAACATATAAACATTTGGAACTAATAATGCTGCCCATGTTAATGCTGAGTTGCGCTTTTTTTAAGAAACCACAATC 
TGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTACAT^ 

AAAAAATTGCCAATCATAAATAGTAATCATGACGTAACTTGGATAAAAACAAAGGCAATGACAATCTTAGGCGAAG 
ATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTC 

ATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATGGAGATGATGAATATGAAATTGAAGAT 

GTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTTAGCTGTTGAAAATTCACAAGAAGAAG 

GATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAAAATGCTTTTAAATTAACATATAAAAA 

TGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTACTCAAGAAACTAAAATTTATAAAATT 

TCTCTTAATTCAAAATTAATTATTGAATTTTTAAAAGAAGTGCTAAAAGAAAATTCTATATTAAAA 

GAGATTTATTTGAAGATATATAk 

t03A.nt BB006 

TGCGCTTTTTTTAAGAAACCACAATCTGTACATCAAGACAGCAATACTGGCAAACCAATAAGCGATGAAAAATTAC 
ATTTAATATCAGGCAAAATTTCAAATAAAAAATTGCCAATCATAAAT 

AAAGGCAATGACAATCTTAGGCGAAGATGGAAAAGAAATACCAGAATTTAAAAACAAATTTGGATATTCTTATATA 
ATATCTCCTGTAAAAATGGATGGAAAATATAGTTATTACGCGTCATTATTAATACTTTTTGAAACAACTAAAAATG 
GAGATGATGAATATGAAATTGAAGATGTTAAATTTGTAACAGCTGGTTCCACCCTAGAACTTAAAAATTCTCTTTT 
AGCTGTTGAAAATTCACAAGAAGAAGGATATGTTACTGCATACCCATTTGGAATATTGATGAGTGACGAGATTAAA 
AATGCTTTTAAATTAACATATAAAAATGGTCATTGGAATTATATGCTTGCAGATTTAACTGTCAAAAATAAACTTA 
CTCAAGAAACTAAAATTTATAAAATTTCTCTTAATTCAAAATTAATTATTGAATTTTTAAAAGAAGTGCTAAAAGA 
AAATTC TAT ATT AAAAGAC AT AGC TGGAG ATTT ATTTGAAG AT AT A 

f03A.aa B3006 

fnvnf1styrlkkalngikeedlmvfrtykh 

klpiinsnhdvtwiktkamtilge'dgkeipefknkfgysyiispvkmdgkysyyasllilfettkngddeyeiedv 

kfvtagstlelknsllavensqeegyvtaypfgilmsdei 

lnskl i i eflkevlkens i lkdi agdlfed i 

t03A.' aa BB006 • 

CAFFKKPQSVHQDSNTGKPISDEKLHLISGKISNKKLPIINSNHDVTWIKTKAMTILGEDGKErPEFKNKFGYSYI 
ISPVKMDGKYSYYASLLIIiFETTKNGDDEYEIEDVKFVTAGSTLELKNSLLAVENSQEEGYVTAYPFGILMSDEIK 
NAFKLT YKNGHWNYMLADLTVKNKLTQETKI YKI S LNSKL 1 1 E FLKEVLKENS I LKDI AGDLFEDI 

f04A.nt B3011 

TAATT AC C AAAGAT AAGT AAAC TTGC AAAT AAAAC T AC ACGTATTGAAAGT AG ATTTG AAATTTC G ATT AT ATTT A 
TATATAATGGCACTAAATATCTGAAAATGAAGGAGAAGCGGGTGGGCAATAAAATTTTTTATATTTCAGTGGTTTT 
AATTTTAATAGTTGGTTGCGACTGGGGAACT ATT AAAGAT AAAAGTACAGAAATTTCCAAGCTATTAAGAACGGAC 
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AAAGATAAGACTAAAAATCAAGATAGAATAGAATTGGGTGAAGATAATTTTGTATCTAAAAATAATATGTCTACTA 
CTGATACGGGCATTACTAGTTTAGGAAGTCTAAACAACTTGGATTTAATTAATCGTTCACAGCGGGTCAGTGAACC 
ACCTATAATCTCAAATGAGAAAGCCATAGCTACTCAAGCAAAAGTAGATTTAATGAACAACATTAATGTTACTATA 
ATAAACCCAAAACCAGCTCAAAATTTGGGAAATTCTTTAAACAATACTACTACTGAAGATAGTGTGAAGTTTTTA.T 
CAATTGAAAACCAAGAGTGGCTTATTAGTAAAAAGATTTTGCCCAGTAAGTTGGAAAATTTAGAAAGCTTTCTAAA 
AACACAACACGAAAAAGAAGCTTTTAAGACGGCTAAAACTATACAAAGTCTCATTAGTAATTCCAATATGGGTAAA 
GAAATTATTAAGTTTAAGGAAGAATATTACAAACTTTATAATTTGTTTGAAGGCATACAACAAAAATTCCATAGTC 
AAAGGAATTCATTTATAAAAGATACTAAATTTGGGGAAAATAGACAAAAAAATGCAGTTATATTTAAATCCTTTTC 
ATCTATAGAGAAAGAAATTAGAGATTTGAATTATAAGTTGNGTGAAATCCAAAGTAATTTTCAAATTGCAGATGTT 
AGCTGGAATAATGCAAACTCTCTTTTAAAAGAATCTATAGAAAAATTAATTCAGGCAATTGAAAAAAGGTATGACA 
ATGAGAGTAGAAAGCAAGGTCAAATTGGTGGACCTGCTAATAGATGGGATAAAAATCAAGCTGACAATTTTGCTAA 
GGATGCAAAGTATAAGGCAGAACATTCAGCAAATGATTTGGAAAATGCAGCCAACTATTTTAGATATAGTTGTTCA 
AATGAAAAAGAAGCTAAAAAGCTATTAGAAGAAATTAAAAAAAGATTTGTACGAATTGGTATTAGCCTATAA 

t0 4A.nt BB011 

TGCGACTGGGGAACTATTAAAGATAAAAGTACAGAAATTTCCAAGCTATTAAGAACGGACAAAGATAAGACTAAAA- 
ATCAAGATAGAATAGAATTGGGTGAAGATAATTTTGTATCTAAAAATAATATGTCTACTACTGATACGGGCATTAC 
TAGTTTAGGAAGTCTAAACAACTTGGATTTAATTAATCGTTCACAGCGGGTCAGTGAACCACGTATAATCTCAAAT 
GAGAAAGCCATAGCTACTCAAGCAAAAGTAGATTTAATGAACAACATTAATGTTACTATAATAAACCCAAAACCAG 
CTCAAAATTTGGGAAATTCTTTAAACAATACTACTACTGAAGATAGTGTGAAGTTTTTATCAATTGAAAACCAAGA 
GTGGCTTATTAGTAAAAAGATTTTGCCCAGTAAGTTGGAAAATTTAGAAAGCTTTCTAAAAACACAACACGAAAAA 
GAAGCTTTTAAGACGGCTAAAACTATACAAAGTCTCATTAGTAATTCCAATATGGGTAAAGAAATTATTAAGTTTA 
AGGAAGAATATTACAAACTTTATAATTTGTTTGAAGGCATACAACAAAAATTCCATAGTCAAAGGAATTCATTTAT 
AAAAGATACTAAATTTGGGGAAAATAGACAAAAAAATGCAGTTATATTTAAATCCTTTTCATCTATAGAGAAAGAA 
ATTAGAGATTTGAATTATAAGTTGNGTGAAATCCAAAGTAATTTTCAAATTGCAGATGTTAGCTGGAATAATGCAA 
ACTCTCTTTTAAAAGAATCTATAGAAAAATTAATTCAGGCAATTGAAAAAAGGTATGACAATGAGAGTAGAAAGCA 
AGGTCAAATTGGTGGACCTGCTAATAGATGGGATAAAAATCAAGCTGACAATTTTGCTAAGGATGCAi^GTATAAG 
GCAGAACATTCAGCAAATGATTTGGAAAATGCAGCCAACTATTTTAGATATAGTTGTTCAAATGAAAAAGAAGCTA 
AAAAGCTATTAGAAGAAATTAAAAAAAGATTTGTACGAATTGGTATTAGCCTA • 

f04A.aa BB011 

LPKISKLANKTTRIESRFEISIIFIYNGTKYLKMKEKRVGNKIFYISWLILIVGCDWGTIKDKSTEISKLLRTDK 
DKTKNQDRIELGEDNFVSKNINnytSTT 

NPKPAQNLGNSLNNTTTEDSVKFLSIENQEWLISKKILPSKLENLESFLKTQHEKEAFKTAKTIQSLISNSNMGKE 

IIKFKEEYYKLYNLFEGIQQKFHSQRNSF.IKDTKFGENRQKNAVIFKSFSSIEKEIRDLNY'KLXEIQSNFQIADVS 

WNNANSLLKESIEKLIQAIEKRYDNESRKQGQIGGPA^ 

EKEAKKLLEEIKKRFVRIGISL 

t04A.aa BB011 

CDWGTIKDKSTEISKLLRTDKDKTKNQDRIELGEDNFVSKNNMSTTDTGITSLGSLNNLDLINR5Q R VSEPPIISN 
EKAIATQAKVDLMirailWTIINPKPAQNLGNSLNNTTTEDSVKFLSIENQEW 

EAFKTAKTIQSLISNSNMGKEIIKFKEEYYKLYNLFEGIQQKFHSQRNSFIKDTKFGENRQKNAVIFKSFSSIEKE 

IRDLNYKLXEIQSNFQIADVSWNNANSLLKESIEKL^ 

AEHSANDLENAANYFRYSCSNEKEAKKLLEEIKKRFVRIGISL 

fOSA.nt 3B009 

T AAAT AAATTGTAGG ATAAAAATGAAAC AAAAATACG AAAAC T ATTTT AAA-AAAAGATTAATTTT AAAC C T ATT AA 
TATTTTTACTACTAGCATGCTCAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATAAAACATGAATA 
CAATATTTTGGGCAGTTCAAGTCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTACATTGCAGCCATGCATTTA 
TTTAAAAAAGAAAAOGGCAAGATTGAAAAAATTGATTTGAGCAATTCT^ 

TATCTGGAAAAACCTATCTTTTAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGAGCTAAATGGAAAAGATTG 
GACATTAAAATTTAAAAAACCGCTAAAAGCATATAAATTCTTAAAATCCGTAGAAGAGATGGCGTAA 



213 

TABLE 1. Nucleotide and Amino Acid Sequences 



tOSA.nt BB009 

TGCTCAAGCGAATCCATATTTTCACAATTAGGAAATCTGCAAAAAATAAAACATGAATACAATATTTTGGGCAGTT 
CAAGTCCAAGAGGAATTTCTCTAGTAGGAGAAACTCTCTA.CATTGCAGCCATGCATTTATTTAAAAAAGAAAACGG 
CAAGATTGAAAAAATTGATTTGAGCAATTCTTATGAGTTTATAAACGACATTGTAAATATATCTGGAAAAACCTAT 
CTTTTAGCGCAAAACAAAGAAGAAGAATTAGAAGTTTGCGAGCTAAATGGAAAAGATTGGACATTAAAATTTAAAA 
AACCGCTAAAAGCATATAAATTCTTAAAATCCGTAGAAGAGATGGCG 

fOSA.aa BB009 - 

INCRIKMKQKYENYFKKRLILNLLIFLLLACSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAAMHLF 
KKENGKIEKIDLSNSYEFINDIVNISGKTYLLAQNKEEELEVCELNGKDWTLKFKKPLt^AYKFLKSVEEMA 

tOSA.aa BB009 

CSSESIFSQLGNLQKIKHEYNILGSSSPRGISLVGETLYIAAMHLFKKENGKIEKIDL'SNSYEFINDIVNISGKTY 
LLAQNKEEELEVCELNGKDWTLKFKKPLKAYKFLKSVEEMA 

fOSA.nt BB014 * 

TAAGGAGCATATATGAGGATTTTGGTTGGCGTTTGTATAATAGCATTGGCTTTATTGGGTTGTTATTTGCCTGATA 
ATCAGGAACAAGCTGTTCAAACTTTTTTTGAGAATTCGGAAAGTAGTGATATGGGTTCCGATGAGATTGTTACTGA 
AGGCATATTTTCTAGTTTAAAATTATATGCGTCTGAACATCGTTTATTGGTTGAGATAAAAAAGACTTTAATTAGT 
TTAAAAGATCCTAATTATCNKGNTGTAGTACNCCCAGTGAGTGACTATAATGAGGAGTATTTTAATAAATTCTTTC 
TAGATTTAGGGTCTGAGCAATCTAAAGACCTGATTAAGTTGTTTATTATGGTAAAAAATGAGCAGAACAATAATAA 
ATTTATGCGTATAGTTCGTTGGCTGTATTCATGTATAGAGGAGTTATATTCTCTAGATATTAAGTATTCTGGCGAG 
GGGAGCCATGAGTATAATCGTAATATGCCTAGACCCACTGCTTATGAACAATATTTAAAAGTGAAGAGGTATGATT 
ATAATAGCCCAGTTTCTATTTTACCTACATAA 

t06A.nt BB014 

TGTTATTTGCCTGATAATCAGGAACAAGCTGTTCAAACTTTTTTTGAGAATTCGGAAAGTAGTGATATGGGTTCCG 
ATGAGATTGTTACTGAAGGCATATTTTCTAGTTTAAAATTATATGCGTCTGAACATCGTTTATTGGTTGAGATAAA 
AAAGACTTTAATTAGTTTAAAAGATCCTAATTATCNNGNTGTAGTACNCCCAGTGAGTGACTATAATGAGGAGTAT 
TTTAATAAATTCTTTCTAGATTTAGGGTCTGAGCAATCTAAAGACCTGATTAAGTTGTTTATTATGGTAAAAAATG 
AGCAGAACAATAATAAATTTATGCGTATAGTTCGTTGGCTGTATTCATGTATAGAGGAGTTATATTCTCTAGATAT 
TAAGTATTCTGGCGAGGGGAGCCATGAGTATAATCGTAATATGCCTAGACCCACTGCTTATGAACAATATTTAAAA 
GTGAAGAGGTATGATTATAAT 

f06A.aa BB014 

GAYMRILVGVCIIALALLGCYLPDNQEQAVQTFFENSESSDMGSDEIVTEGIFSSLKLYASEHRLLVEIKKTLISL 
BOJPNYXXVVXPVSDYNEEYFNKFFLDLGSEQSKDLXKLFIMVKNEQNNNKFMRIVRWLYSCIEELYSLDIKYSGEG 
SHE YNRNMPRPTAYEQ YLKVKRYD YNS P VS I L PT 

t06A.aa BB014 . 

CYLPDNQEQAVQTFFENSESSDMGSDEIVTEGIFSSI J KLYASEHRLLVEIKKTLISLXDP^^YXXVVX?VSDYNEEY 

FNKFFLDLGSEQSKDLIKLFIMVTCNEQNNN^ 

VKRYDYN 

f07A.nt BB023 

TAAAGTATTTTATTTTTTTTATTATCCACTGTTCTTTTTGCTCAAGAGACTGATGGATTAGCAGAGGGTTCTAAAA 
GGGCAGAGCCTGGAGAATTAGTTTTAGATTTTGCCGAGCTTGCAAGAGATC.CAAGTTCAACTAGACTTGATCTTAC 
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AAATTATGTTGATTATGTATATTCGGGCGCTTCTGGTATTGTTAAGCCGGAAGATATGGTTGTAGATCTTGGGATA 
AATAATTGGAGCGTTTTACTTACTCCTTCTGCAAGGTTGCAGGCTTACGTTAAAAATTCAGTTGTTGCGCCCGCTG 
TTGTTAAGAGTGAGTCAAAAAGGTACGCAGGTGATACTATTTTAGGGGTAAGAGTTTTGTTTCCAAGCTATTCTCA 
ATCATCTGCTATGATTATGCCACCATTTAAAATTCCTTTTTATTCAGGGGAAAGTGGCAATCAATTTTTAGGCAAA 
GGTCTTATTGATAACATTAAAACCATGAAAGAAATTAAGGTATCTGTTTATAGTTTAGGGTATGAGATAGATCTTG 
AGGTTTTATTTGAAGATATGAATGNCATGGAATATGCTTNNTCTATGGGTACTTTAAAGTTTAAAGGGTGGGCTGA 
TTTAATTTGGTCAAATCCTAACTATATTCCTAATATATCATCCAGAATTATTAAAGACGATGTTCCAAATTATCCT 
CTTGCTTCAAGTAAAATGAGATTTAAGGCTTTTAGAGTTTCAAAGTCACACAGTTCAAAAGAGCAAAATTTCATCT 
TTTATGTTAAAGATTTAAGAGTTCTTTATGATAAGTTGAGTGTTTCAATAGATTCTGATATTGACAGTGAGTCTGT 
ATTTAAAGTTTATGAGACTAGCGGAACTGAATCCCTTCGTAAATTAAAGGCACACGNAACNTTTAAAAGNGTTTTA 
AAGCTTAGAGAAAAAATTTCTATGCCTGAAGGCTCTTTCCAAAACTTTGTAGAAAAGATTGAGAGTGAAAAACCTG 
AAGAATCATCTCCGAAAAATTAG 

t07A.nt BB023 v * ■ . . " 

GAGGGTTCTAAAAGGGCAGAGCCTGGAGAATTAGTTTTAGATTTTGCCGAGCTTGCAAGAGATCCAAGTTCAACTA 

GACTTGATCTTACAAATTATGTTGATTATGTATATTCGGGCGCTTCTGGTATTGTTAAGCCGGAAGATATGGTTGT 

AGATCTTGGGATAAATAATTGGAGCGTTTTACTTACTCCTTCTGCAAGGTTGCAGGCTTACGTTAAAAATTCAGTT 

GTTGCGCCCGCTGTTGTTAAGAGTGAGTCAAAAAGGTACGCAGGTGATACTATTTTAGGGGTAAGAGTTTTGTTTC 

CAAGCTATTCTCAATCATCTGCTATGATTATGCCACCATTTAAAATTCCTTTTTATTCAGGGGAAAGTGGCAATCA 

ATTTTTAGGCAAAGGTCTTATTGATAACATTAAAACCATGAAAGAAATTAAGGTATCTGTTTATAGTTTAGGGTAT " 

GAGATAGATCTTGAGGTTTTATTTGAAGATATGAATGNCATGGAATATGCTTNNTCTATGGGTACTTTAAAGTTTA 

AAGGGTGGGCTGATTTAATTTGGTCAAATCCTAACTATATTCCTAATATATCATCCAGAATTATTAAAGACGATGT 

TCCAAATTATCCTCTTGCTTCAAGTAAAATGAGATTTAAGGCTTTTAGAGTTTCAAAGTCACACAGTTCAAAAGAG 

CAAAATTTCATCTTTTATGTTAAAGATTTAAGAGTTCTTTATGATAAGTTGAGTGTTTCAATAGATTCTGATATTG 

ACAGTGAGTCTGTATTTAAAGTTTATGAGACTAGCGGAACTGAATCCCTTCGTAAATTAAAGGCACACGNAACNTT 

TAAAAGNGTTTTAAAGCTTAGAGAAAAAATTTCTATGCCTGAAGGCTCTTTCCAAAACTTTGTAGAAAAGATTGAG 

AGTGAAAAACCTGAAGAATCATCTCCGAAAAAT 

f07A.aa BB023 

SILFFLLSTVLFAQETDGLAEGSKRAEPGELVLDFAELARDPSST 

NWSVLLTPSARLQAWKNSWAPAWKSESKRYAGDTILGVRVLFPSYSQSSAMIMPPFKIPFYSGESGNQFLG 
LIDNIKTMKEIKVSVYSLGYEIDLEVLFEDMNXMEYAXSMGTLKFKGWADLIWSNPNYIPNISSRIIKDDVPNYPL 
ASSKMRFKAFRVSKSHSSKEQNFIFWKDLRVLYDKLSVSIDSDIDSESVFKVYETSGTESLRKLKAHXTFKXVLK , 
. LREKISMPEGSFQNFVEKIESEKPEESSPKN 

t07A.aa BB023 

EGSKRAEPGELVLDFAELARDPSSTRLDLTNYVDYWSGASGIVKPEDMV^ 

VAPAWKSESKRYAGDTILGVRVLFPSYSQSSAMIMPPFKIPFYSGE3GNQFLGKGLIDNIKTMKEIKVSVYSLGY 
EIDLEVLFEDMNXMEYAXSMGTLKFKGWADLIWSNPN^ 

QNFIFYV1CDLRVLYDKLSVSIDSDIDSESVFKVYETSGTESLRKLKAHXTFKXVLKLREKISMPEGSFQNFVEKIE 
SEKPEES'SPKN - . 

f08A.nt BB024 

TGAATATTAATAATAAAAAAAGGAGTAACAATGAAAATCATCAACATATTATTTTGTTTATTTTTACTAATGCTAA 
ACGGCTGTAATTCTAATGATAATGACA.CTTTAAAAAACAATGCCCAACAAACAAAAAGACGGGGAAAGCGTGATTT 
AAC CC AAAAAGAAAC AAC AC AAG AAAAACC AAAATC T AAAGAAG AAC T ACTT AGAGAAAAGC T ATC TGAC GATC AA 
AAAACACATCTTGACTGGTTAAAACCCGCTTTAACTGGTGCTGGAGAATTTGACAAATTCTTAGAAAATGATGATG 
ATAAAATAAAATCAGCACTTGATCATATAAAAACTCAACTTGATAGT^ 

" AACCACTTTCAAAACTGTGGTTACAGAATTCTTTAAAAATGGTGATATAGATAATTTTGCAACTGGAGCGGTTAGT 
AACTGCAATAATGGTGGCTAA 



t08A.nt BB024 



215 

TABLE 1 . Nucleotide and Amino Acid Sequences 



TGTAATTC T AATGATAATGAC AC TTT AAAAAAC AATGCCCAACAAACAAAAAGACGGGGAAAGCGTGATTTAACCC 

AAJW^GAAACAACACAAGAAAAACCAAAATCTAAAGAAGAACTACTTAGAGAAAAGCTATCTGA 

ACATCTTGACTGGTTAAAACCCGCTTTAACTGGTGCTGGAGAATTTGACAAATTCTTAGAAAATGATGATGATAAA 

ATAAi^TCAGCACTTGATCATATAAAAACTCAACTTGATAGTTGTAATGGTGATCAAGCAGAACAACAAAAAACCA 

CTTTGAAAACTGTGGTTACAGAATTCTTTAAAAATGGTGATATAGATAATTTTGCAACTGGAGCGGTTAGTAACTG 

CAATAATGGTGGC 

f08A.aa BB024 

ILIIKKGVTMKIINILFCLFLLMLNGCNSNDNDTLKNNAQQTKRRGKRDLTQKETTQEKPKSKEELLREKLSDDQK 
THLDWLKPALTGAGEFDKFLENDDDKIKSALDHIKTQLDSCNGDQAEQQKTTFKTVVTEFFKNGDIDNFATGAVSN 
CNNGG 

t08A.aa BB024 

CNSNDOTTLKNNAQQTKRRGKRDL51TQKETTQEKPKSKEELLREKLSDDQKTHLDWLKPALTGAGEFDKFLENDD 
DKIKSALDHIKTQLDSCNGDQAEQQKTTFKTWTEFFKNGDIDMFATGAVSNCNNGG 

f 09A.nt BB025 ' 

TGAATATTAATAATAAAAAAAGGAATAATAATGAAAATTATCAACATATTATTTTGTTTATTTTTACTAATGCTAA 
ACGGCTGTAATTCTAATGATACTAATAATAGCCAAACAAAAAGTAGACAAAAACGTGATTTAACCCAAAAAGAAGC 
AAC AC AAGAAAAAC C T AAATC TAAAG AAGAAC TTC T TAG AG AAAAGC TAAATG AT AATC AAAAAAC AC AC C TTG AC 
TGGTTAAAAGAAGCTCTGGGCAATGATGGAGAATTTAATAAATTTTTAGGATATGATGAAAGCAAAATAAAATCTG 
CACTTGATCATATAAAGAGTGAACTTGACAGTTGTACTGGAGATAAGGTTGAAAATAAAAATACCTTCAAGCAGGT 
CGTTCAGGAGGCCCTTAAAGGGGGCATAGACGGCTTTGAAAATACTGCAAGTAGTACGTGCAAAAATTCATAA 

t09A.nt BB025 

TGTAATTCTAATGATACTAATAATAGCCAAACAAAAAGTAGACAAAAACGTGATTTAACCCAAAAAGAAGCAACAC 
AAGAAAAAC C T AAATC T AAAGAAGAACTTCTT AGAGAAAAGC TAAATGAT AATC AAAAAAC AC AC C TTGACTGGTT 
AAAAGAAGCTCTGGGCAATGATGGAGAATTTAATAAATTTTTAGGATATGATGAAAGCAAAATAAAATCTGCACTT 
GATCATATAAAGAGTGAACTTGACAGTTGTACTGGAGATAAGGTTGAAAATAAAAATACCTTCAAGCAGGTCGTTC 
AGGAGGCCCTTAAAGGGGGCATAGACGGCTTTGAAAATACTGCAAGTAGTACGTGCAAAAATTCA 

f09A.aa BB025 - 

ILIIKKGIIMKIINILFCLFLLMLNGCNSNDTNNSQTKSRQKRDLTQKEATQEKPKSKEELLREKLNDNQKTHLDW 
LKEALGNDGEFNKFLGYDESXIKSALDHIKSELDSCTGDKVENKNTFKQWQEALKGGIDGFENTASSTCKNS 

t09A.aa BB025 

CNSNDTNNSQTKSRQKRDLTQKEA5 1TQEKPKSKEELLREKLNDNQKTHLDWLKEALGNDGEFNKFLGYDESKIKS 
ALDHIKSELDSCTGDKVENKNTFKQWQEALKGGIDGFENTASSTCKNS 
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Na+/H+ antiporter [Bacillus firmus] >pirlA41594IA41594 


Na+/H+ antiporter (nhaC) [Haemophilus influenzae] 
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hypothetical protein [Bacillus subtilis] >gnllPIDIel 182969 


(AEOOl 164) conserved hypothetical protein [Borrelia burgdorferi] 
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(AE000788) B. burgdorferi predicted coding region BBK47 [Borrelia 


(AE000788) B. burgdorferi predicted coding region BBK49 [Borrelia 


(AE000788) immunogenic protein P37, putative [Borrelia burgdorferi] 


(AE000788) immunogenic protein P37 [Borrelia burgdorferi] 


immunogenic protein P37 [Borrelia burgdorferi] 


(AE000788) immunogenic protein P37, putative [BoiTelia burgdorferi] 


(AEOO 11 80) B. burgdorferi predicted coding region BB0826 [Borrelia 


(AEOOl 162) serine-type D-Ala-D-Ala carboxypeptidase (dacA) 
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Probable penicillin-binding protein. [Escherichia coli] 
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Ipenicillin-binding protein 5 [Bacillus subtilis] 
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(AE000787) B. burgdorferi predicted coding region BBJ47 [Borrelia 


(AE000785) B. burgdorferi predicted coding region BBE16 [Borrelia 


(AEOOl 180) B. burgdorferi predicted coding region BB0823 [Borrelia 


(AEOOl 180) B. burgdorferi predicted coding region BB0823 [Borrelia 


(AEOOl 1 59) carboxypeptidase, putative [BoiTelia burgdorferi] 


(AF006665) YokZ [Bacillus subtilis] 


(AFO 15775) carboxypeptidase [Bacillus subtilis] >gnllPIDIe 11 85433 


D,D-carboxypeptidase [Enterococcus faecalis] 
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(AEOOl 159) CTP synthase (pyrG) [Borrelia burgdorferi] 


CTP synthase (pyrG) [Methanococcus jannaschii] >pirlE64446IE64446 


(AE001088) CTP synthase (pyrG) [Archaeoglobus fulgidus] 


CTP synthetase [Synechococcus PCC7942] >splQ54775IPYRG SYNP7 
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CTP synthetase [Synechocystis sp.] >pirlS75840IS75840 CTP 


CTP synthetase [Bacillus subtilis] >gil853762 CTP synthase [Bacillus 


(AE000735) CTP synthetase [Aquifex aeolicus] 


CTP synthetase (pyrG) [Haemophilus influenzae] >pirlF64 181 IF64 181 
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KAE000826) CTP synthase [Methanobacterium thermoautotrophicum] 


CTP synthase [Mycoplasma capricolum] >pirlS77767IS77767 CTP 
synthase 


|cytidine triphosphate synthetase precursor [Giardia intestinaUs] 


CTP synthetase (EC 6.3.4.2) [Escherichia coli] 


CTP synthetase [Escherichia coli] >gi!1789142 (AE000361) CTP 


CTP synthase [Azospirillum brasilense] >pirlI39496IS25101 CTP 


|(AE001 158) B. burgdorferi predicted coding region BB0563 [Borrelia 


phosphotransferase enzyme II [Borrelia burgdorferi] >gi!2688474 
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(AF046999) 2.9-9 lipoprotein [Borrelia burgdorferi] 


surface-exposed lipoprotein [Borrelia garinii] 


lipoprotein. [Borrelia burgdorferi] 


(AF000270) lipoprotein [Borrelia burgdorferi] >gil3095109 


(AF046998) 2.9-8 lipoprotein [Borrelia burgdorferi] 


lipoprotein [Borrelia burgdorferi] 


(AE001 149) fructose-bisphosphate aldolase (fba) [Borrelia 


fructose 1,6-bisphosphate aldolase [Escherichia coli] >gil41423 


(AF037440) fructose 1,6-bisphosphate aldolase [Edwardsiella 


fructose-bisphosphate aldolase (fba) [Haemophilus influenzae] 


fructose 1,6-bisphosphate aldolase [Campylobacter jejuni] 


fructose 1,6-bisphosphate aldolase [Schizosaccharomyces 


yeast fructose-bisphate-aldolase [Saccharomyces cerevisiae] >gil3696 


fructose- 1,6-bisphosphate aldolase [Euglena gracilis] 


fructose 1,6 bisphosphate-aldolase [Neurospora crassa] 


fructose-bisphosphate aldolase [Corynebacterium glutamicum] 


Fba [Mycobacterium tuberculosis] , 


fructose-bisphosphate aldolase [Mycoplasma genitalium] 


hypothetical protein [Bacillus subtilis] >gnllPE)lel 184692 


|(AE001 148) DNA gyrase, subunit A (gyrA) [Borrelia burgdorferi] 


DNA gyrase subunit A [Clostridium acetobutylicum] 


|(AE001072) DNA gyrase, subunit A (gyrA) [Archaeoglobus fulgidus]. 


ORF 821 (aa 1-821) [Bacillus subtilis] >gnllPIDId 1005785 A subunit of 


gyrase A subunit [Pseudomonas aeruginosa] >splP48372IGYRA PSEAE 
DNA 


DNA gyrase A [Campylobacter jejuni] >pirlA48902IA48902 DNA gyrase 


8244 


gil3095107 


gnllPIDIe26 
8242 


gil 1209837 1 


o 

CO 
CN 

CN 

CN 

*s> 


gil3095105 | 


gil 1209873 | 


o 

»o 

CO 
CO 

oo 

CN 

'Eh 


VO 

CN 

oo 
oo 

'Eh 


VO 
VO 
CO 

o 

CN 

'Eh 


gil 1573507 


5- 

oo 

r- 
vo 

Eh 


gnllPIDldlO 
04756 


gil433637 


gnllPIDIel9 
10134 


gil 1334980 


vo 
O 

'Eh 


g nllPrDle31 
5480 


gil 1045692 


gnllPIDldlO 
03809 


gil2688338 


Igil 1790876 


CO 

vo 

O 

VO 
VO 
CN 

'Eh 


|gil40019 


as 

CN 
G\ 
ON 
VO 

'Eh 


vo 

o 

CN 

'Eh 




co 

cd 
CN 


f47-2.aa . 


cd 

CO 

CN 
i 

r- 

a 


CO 
CO 

CN 
t 


cO 
CO 

CN 

S 


CO 
cO 

CN 
i 

a 


CO 


CO 

CO 

a 


CO 
CO 


cO 

CO 


CO 


CO 
cO 

r- 
S 


cd 

CO 

a 


CO 

a 


cO 

CO 

r- 

a 


cO 

CO 

a 


f477.aa 


CO 
CO 

r- 

a 


CO" 
CO 

a * 


CO 
CO 

od 

OO 

a 


cO 
cO 

od 
oo 


CO 

CO 

CO 
CO 

a 


CO 

CO 

od 
oo 

a 


.CO 
CO 

od 
oo 


cO 
CO 

od 
oo 

a 



233 



cn 
to 



o 
vo 

co 



vo 

CN 
CO 



O 
O 
cn 



to 
on 
m 



<2 

S 
r- 
r- 

o • 

a 

Cu 

CO 

A 



CO 

JO 



o 

Cd 

42 

a 

u 



COQ 



CO 

o 
"3 

a 

-O 



o 

C3 

o 
o 



o 

vb 



in 

CN 



z 

Q. 

p- ' 

vo 

o 

co 

c/3 

r- 
to 
o 

on 
CO 

cu 

A 
— -t 
X 

cd 

o 

cd 

as 



c 

3 

43 
3 



O 
VO 

I 

o 

CN 



CN 
Xf 
VO 



on t-, 
cd <D 

^ E 
o 

-S3 

2 o4 
P 2 



o 

Cd 
O 



CO 
< 

Q 



CO 

o 
o 

o 

< 



o 

VO 
I 

o 

ON 



CO 
CO 
CO 



Cd 
P 

5b 
S 

CO 

6 

3 

c 

O 
O 
>% 

s 

3 ■ 
42 
3 

on 

<L> 

GO 

cd 

GO 
< 

O 



r- 

W 
o 
co 



CO 

co 

CO 



co 



CO 



o 

cd 
X> 
O 

o 



CO 

c 



4= 

I 

cu 



c 

o 

CO 

3 
Cu 
o 

< 

Q 



w 

o 

CN 
VO 



lO 



o 

CN 

w 
o 

CO 

cn 



xf 

CO 



co 



VO 

CO 



COD 

A 



O 
o 
cd 

43 

O 

n 

<u 
n 
o 

co 

a 

»o 
r- 
co 

t 

< 
< 

o 

3 
O 

5-1 ! 

<o 

co 



t-i 

O 
CP 



cd 

e 

CO 

J2 
15- 
o 
o 



GO 



< 



o 

til 
o 
p 

CO 

co" 

VO 

xr 



o 
o 

W 
o 

CO 



CN 
0\ 
i 

m 

O 
VO 

xf 



O 

GO, 

3 

4=) 
Cd 



Cd 
O 

c 
o 

cd 
C 

1 

o 



■S 

o 
-a 

GO 
i— 
3 

cd 

1 

O 

a 

w 

CO 

> 

O 

S-t 
CL 

4J 
CJ 
3 

3 

er 

CO 

J3 



o 

GO 



o 

co 

> 

53 
p 

O 



3 

er 

CO 



CO 
CO 
I 

W 
O 

CO 



CN 
VO 
CO 



o 

vo 

VO 



o 
-o 

GO 

3 
4D 

C3 



O 

CN 

co 

> 



3' 

GO^r- 
3 

.2 

b 
o 



o 

GO 

in 



> 



d 
o 

c 
o 

c 

s 

o 
o 
?-> 

co 



o 

& 

co 



o 

a 

co 
> 



cr 

CO 



3 
CU 



P 



CN 
CO 

o 

CN 
VO 



VO 

i 

W 
o 

CN 
CO 



o 

vo 



o 



vo 

CO 
CN 
i 

o 

VO 



VO 
CO 
CO 



o 

GO, 
3 



O 
3 

.2 
c 

1 

o 
o 



O 

-a 

GO 

3 
43 

1 

O 

CO 

> 

C 

-I— » 

o 

Cu 

'<U 
O 
G 
(U 
3 

cr 

<u 

CO 

O 



O 
GO 
3 



Cu cu : 



3 
Cul 



o 

CO 

> 

G 

O 
I-* 
Cu 

o 

G 

<L> 
3 

cr 

<u 

CO 

M 

I 

Q. 

s 



•a 

O 

-o 

GO 
*— . 
3 
,0 

1 

O 

CO 

> 
s 

o 

Cu 
o 



o 

GO 

u- 
3 



O 

-a 

GO 



4^ 

cd 



O 

a 

CO 

> 

C 

o 

Cu 

4J 

O 



CT 



O 

a 

CO 

> 

G 
£ 

o 

Cu 

(U 

o 
c 

<u 

3 

CP) \ 

CO 



O 
TD 
GO 

3 
-O 

cd 
• ^ 

c 
o 

a 



o 

GO 
cd 



O 

a 

CO 



u* 
O 

'a 

CO 
CN 

o 

G 
O 

*G0 
(O 

GO 
C 

-3 
o 
o 

T3 
O 



cd 
o 



o 

GO 



O 



o 

l G 
(U 
3 

crj 

CO 



cd 
C 

s 

o 

o 



o 

GO 
3 

PQ 
co 



Cu 

E 



E — 



o 
-a 

GO 

t_ 

3 

cd 



O 

a 



(XI 



vo 

CN 
vo 
■ vo 

'5b 



VO 
CN 



= CN 

c as 

G0VO 



vo 
CO 

co 
"go 



CO 

G VO 
GOCN 



CN 

e 

c o 
Go*^r 



CO 



3j 

Q- 

Cu CO 
=; co 
c O 
coi cor- 



CO 

vo 



GO 



GO 



CN 
CO 
CN 
CA 
CO 

O 
CI 

5b 



CN 

r- 
vo 

CN 

5b 



CN 
CO 
G\ 
CO 

O 

CN 

5b 



CO 
CO 
CN 
CA 
CO 

o 

CN 



CO 

o 

CO 

o\ 

CO 

o 

CN 



GO GO 'GO 



CO 
CO 
CN 
ON 
CO 

o 

CN 

5b 



CN 
CO 
CO 
CN 
CO 

O 
CN 



GO GO 



VO 
CO 
CO 
CN 
CO 

o 

CN 

5b 



co co 



vo 

CN 

r- 
co 

CO 
CN 
GO 



vo 
xi- 
co 

CO 
CO 

vo 
CN 



CO CO CO 



co 

VO 
CN 

CN 
VO 
CN 

GO 



cd 
cd 

CO 
OO 



cd 
cd 

CO 
CO 
XT 



CO 
CO 



CO 
CO 



cd 
cd 

CO 
CO 



cd 
cd 

CO 
CO 

,xr 



cd 
cd 

cd 
co 
xf 



cd 
cd 

CO 
CO 



CN 



cd 

cd 

CN 
i 

CN 



cd 
CN 



CN 
i 

CN 



cd 
,-d 

i. 

CN 
XT 



cd 
cd 

CN 
i 

Cn 
.Ti- 



ed 
cd 

CN 
t 

CN 
Xt 



cd 
cd 

CN 

) 

CN 



CN 

CN 
Xt 



cd 
cd 

CN 
i 

CN 



cd 
cd 

oi 

CN 



cd. 
cd 

CN 

t 

CN 



cd 
cd 

xr 

ON 



cd 
cd 



234 



CO 

o 

XT 



co 
vo 



cn 

CO 

W 
o 
vq 

cn 



o 

toO 



.O 
03 



O 

PQ 



o 

a 

EH 
w 

co 
co 

CO 

r- 
co 



o 
-a 
too 

03 



OX) 

A 



o 

.2 
o 



Q . 



o 
-o 

03 



O 

PQ 

s 

o 

1_ 

X 
Bl 

w 

vo 

so 
o 

CN 

o 



co 
r- 

CO 



ao 



03 

<3 



HI 
O 

CN 

r- 

CN 
VO 
CN 

co 



£1< 



CO 



CO 
CN 

CN 

ab 



O 

CN 

t 

o 

uo 



CO 
CN 
CN 



.o 
g 

G 

c3 
JG 
O 

-a 

I 

o 

"3 

c 

o 

o 
o 
<o 



CN 
CO 

e 

Cu vo 

G CO 
too On 



CO 
I 

pa 

o 

CO 



CO 

o 

CN 



CD 



o 

03 

O 
too 



o 

JG 
O-, 
en 
O 

-G 

a. 



cr 



toO 



Q CO 

— CO 
C CN 
toO— . 



03 

o3 
i 

uo 



vo 

W 
o 

CN 



CN 
On 



vo 
i 

W 
o 

VO 



vo 



o 
o 



o 

o3^ 
CO 

r- 

& 

CJ 

"S 

cM 

<lT 

xl 
JH 
a. 

£ 
o 
o 

CD 
GO 



o 

CU 

03^ 

too 
o 

o 

s 

o 

CO 



o 

CJ 

s 

on 
O 



o 

a 

VO 
>— f 
PQ 
PQ 

G 

O 

'toO 
toO 



O 

a 

CO 



J*! 



o 

toO 



T3 

o3 

■e 

o 
c 

<D 
o3 



03 ^ 



PQ 

on 1 

CO 

r- 
o 
o 
o 
PJ 
< 



o 

CO 

vo 
o 
o 

O- 



a 

o 

CO 
CN 



VO 



CD 
too 

c 

o3 

<D 
O 



4> 
O 

o 



CD 
G 



V3 
G 

toO 



03 



CN 
CO 

v-i 
vo 

CN 
CN 

cb 



toO 
i 



CO 
CO 
lO 
CO 
VO 

o 

CO 

too 



o 
o 

o 

VO 
CN 

tob 



O 

a\ 

CO 

uo 

CN 
CO 



toO 

cd 
cd 

i 



CO 

o 

CO 

too 



03 
OJ 

o 

< 

CN 
O 

BL 

co 

Q 



CN 

w 

o 

CO 
CO 



=: co 
c O 

toO ON 



03 
C3 

i 

yo 



03 
03 

ViO 



CO 

o 

O 

CO 
CO* 

vo" 

to 

CO 



o 

CO 



CO 

co 



too 
A 



*C 
o 
-a 

toO 
3 

03 



O 

PQ 

a 



o 

O 

C3 



CN 

> 

o 
o 



o 

toO 
3 



o 
PQ 



o 

& 



o 

CN 



UO 
O 



too 

03 

. 03 



vo 

CN 

r- 

CN 

vo 

CN 

tob 



03 
03 

iri 
t 



CN 

o 

CO 



CO 

vep 

o 

CO 

vd 



CN 
CN 
CO 



CN 
VO 

pi 
o 

ON 



o 

ON 
CN 



CN 
O 



o 



CN 

^ 

CO 
VO 
O 
r— 
cn 

CO 

vo 
o 
r- 
co 

'S-l 

A 



2 

a 

03 



CD 



CO 
CN 
O 



co 

, CN 

o 



CN 
CO 

vo 
o 

CN 
CO 
VO 

o 
r- 

cn 

*5- 

A 



O 

-a 

too 



O 
T3 
toO 

3 



o 

• -a 

toO 



<D 

b 
o 
PQ 



o 
-a 
too; 

3 

o3 



O 

toO 
w 
3 

C3 



O 

-a 

toO 

3 
-O 



p 

CD 



g 

o 

PQ 



<D 

G 

O 

a 
pu 

ex. : 



CD 
G 
C3 

1 

<D 



O 

a 

o 

(D 
O 

3 

CD 

-4— » 

3 

O 



CD 

2 

Cu 

.<D 

3 

co 

CD 



O 
toO 

03 



B 



CN 



CN 

co 

vo 

o 
co 

co 
vo 
o 
r^- 



r 13 
5—. 

o 
-a 

toO 



o3 



3 
O 

CD 
, 

o 

CN 



co 

CN 

PJ 

O' 



PJ 

o 



vo 

vo 
CN 



CN 

CO 
CO 
lO 

o 

¥2 

CO 
CO 

to 
o 
r- 
cn 

Cl 
A 



O 

Cl 

W 
co 



CO 

r- 

CO 



too 
A 



O 
toO 



cD 
i 

Uh 

O 

-a 

toO 

!— < 

3 

03 



^=1 
03 



O 

a 

vo 

o 

CO 

O 

PQ 



T3 
O 

o 

0> 



<D 



B-S 

^ PQ 



o 

CD 

o 

CN 

PQ 



o 
PQ 

a 

c 

p 
a. 

CD 
O 
03 
-t— i 

3 
oo 

CD 



O 

toO 
u 
3 



CD 
G 
O 

PQ 



o 
-a 

too 
3 

PQ 



1 o 
o 



CO 
CN 

o 



a0 

o3 
o3 

IO 

i 

VO 



too co 



too 



3 

vo 



o 

CA 
CN 

o 



toO 



toO 



CO 
CO 

O 
vo 

CN 

co 
tobl 



o 
o 

VO 

ON 
CO 

tob 



o 

CN 



CO 

co 



VO 

co 
co 
vo 

CN 



03 
03 

VO* 

! 

VO 



o3 
C3 



VO 
i 

VO 



vo 
vo 



o3 
03 



VO 



03 
C3 

VO 

i 

}0 



too 

03 
03 

VO 

I 

VO 



ao 



03 
o3 

VO* 



o 

VO 



235 



5 

m 

Ul" 

d 

ill 
■ . i 

b 
ry 
□ 



00 

<u 

co 
S3 

03 

» 

03 



Q 

T3 
C 

03 

C 

03 

CQ 

c. 
<u 

O 

c 



O 

c 
cr 

co 

-a 
c 

03 



C 

> 
.5 

c 

PL, 



Oh 

o 
cu 

<u 
x: 



-O 

CO 

<Q 
O 
* C~ 
<D 

cr 

<o 

cn 
CO 

c 



CO 

-2 

O* 

CN 



■a 



'3 



0 



CO 



O0 



GO 



CO 



CO 

o 

03 
CO 

o3 
C 
O 

s 

o 

a 

<u 

co 



a 
5 
o 

i-t 
CL 

o 
03 

=3 
CO 



CO 

pq 
oo 

I 

js 

PJ 

r- 

CN 

o 

CO 

CO 

A 
aJ 

03 
CO 

c 













ON 


oo 


CN 


CN 


co 


r- 


CO 


CN 


on 




VD 


CO 


to 








'3b 


*co 


'cb 



ON 

i 

W 
o 

CO 



oo 

CN 



a 

3 
o 
a, 

O 

iH 

CO 



o3 
CO 

c 



o 

0) 



c 
3 
o 

c? 
o 

o3 



CO 



c 

<U 

c 
o 
a* 

s 

o 
u 

I 

o 



ON 
i 

W 
o 

CN 



CN 
UO 
CN 



CO 
, CO 

2-2 

CuPQ 



o3 
o3 

CN 
O 
UO 



Cu 



CO 

>s 

O 
O 
JG 

o 
o 
c 

CO 
03 

c 
g 



g 
o 

rO 

CO 

C 

03 



O 

CO 

C 



ON 

W 
o 

CN 

vd 



CN 
CN 



6 to 

CU CN 

c oo 

CO— • 



o3 
o3 

CN 
O 
liO 



ON 

T— i 

ON 
CO 

r- 



oo 
A 



o 

a 

<u 
,JG 
o 

CO 

c 
'3 
o 

u-. 

o • 

■4—) 

Z2 
CO 
.CD 



o 

G CN 
COO 



CN 
O 



CO 

pb 
o 
o 



CN 



ViO 
ON 

CO 
CO 



CO 

o 
o 

o- 



T3 



co 
C 
c3 



O 



0^ 

co 
c oo 

CO ' — • 



CN 

o 



o 
-a 

co 



o 

X 
P- 



oo 

CN 

CN 
CO 



CN 



o 

CO 
C3 



CO 
vo 
CN 

CN 
VO 
CN 

CO 



CN 



o 

CO 
c3 



O 

Q 



CN 



O 

-a 

co 

c3 



CO CO 



a 

P- 
oo 



H; 
o 

a 

w 

CO 
CO 

CO 

co 



o 

CO 
1— 
=3 

I 
o 

a 

CN 
QJ Cl 



co 



pa 
o 



CO 

o 

CN 



o 
co^ 

=3 

"S 
fc 
o 

a 

o 

c 
B 
o 



-a 

o 
u 

CO 

O 
CO 



CO 



CO 



c 

O' 



o 
a, 

co 
O 

sz 
Cu 

Ut 

O 

4— I 

s 

cr 
<u 

<D 
G 
<D 
CO 



o 

CO'— • 



CN 



o 
-a 
co 

Uh 

2 

1 

o 

G 

3 
o 

u . 
G- 

<o 

C 

o3 



CO 



G 

S 
o 

<o 
o 

03 



o 

CN 
CO 

o 

oo 

oi 
co 
uo 
O 

r- 
co 
ui 
cL| 

/V 



o 
p^ 



a-O 



CN 

3- 

vo 

ON 
CO 

CO 



p-1 
o 



CO 
ON 



=3 



O 
CO 

Uh 

3 

o3 

c 

O 

5 

o 
04 

(U 
G 
03 



O 

co|-^ 

"G 

s 



CO 



a3 
c 

G 

o3 

x: 

-a 
o 

"03 
CO 

<D 

-a 
o 
o 

G 
C 

13 
o 
Co 

ON 

o 
o 
o 



CN 
CO 

e 

a. 
c 

CO 



03 
03 

CN 




236 





2.30E-14 


2.10E-13 


7.30E-12 


2.00E-150 


2.30E-228 


4.00E-88 


9.10E-86 


4.30E-84 


4.70E-79 


6.90E-69 


8.20E-66 


2.80E-65 


8.10E-62 


1.90E-39 


2.80E-39 


3.90E-39 


. 2,50E-36 


L10E-34 


9.00E-33 


5.70E-145 


1.20E-47 


3.90E-138 


4.00E-47 


9.90E-284 


1.60E-145 




vo 


o 

T— t 


vo 


' 1096 


1637 


VO 
VO 
CN 


vo 

o 

CO 


CN 
CN 


CN 
CN 
CN 


lO 
CN 
CN 


VO 

CO 
CN 


vo 

as 


r- 

CO 


VO 


OO 


vo- 
CN 
CN 


r- 
r» 


m 


CO 


1060 


VO 
CN 


1022 


T— 1 

vo 

CN 


. 2007 


1087 




ORF 73, contains large complex repeat CR 73 [Kaposi's 


E02A10.2 [Caenorhabditis elegans] 


glutamic acid-rich protein [Plasmodium falciparum] 


(AEOOl 146) B. burgdorferi predicted coding region BB0409 [Borrelia 


(AEOOl 146) PTS system, fructose-specific IIABC component (fruA-1) 


similar to fructose phosphotransferase system enzyme II . 
* 


similai* to phosphotransferase system enzyme II [Escherichia coli] 


fructose-specific IIBC component [Escherichia coli] >gil450372 


fructose enzyme II [Rhodobacter capsulatus] >gil46021 fructose. ' 


fructose-permease IIBC component (fruA) [Haemophilus influenzae] 


(AEOOl 164) PTS system, fructose-specific IIABC component (fruA-2) 


phosphotransferase system (PTS) fructose-specific enzyme IIBC 


PTS enzyme-II fructose [Xanthomonas campestris] >pir!B40944IB40944 


similar to fructose-specific phosphotransferase enzyme II 


HrsA [Escherichia coli] >gill786951 (AE000176). 


phosphotransferase enzyme II [Bacillus firmus] 


fruA gene product [Bacillus amyloliquefaciens] >pirlS59965IS59965 


PTS SYSTEM, FRUCTOSE-SPECIFIC IIBC COMPONENT (EIIBC- 
FRU) 


(AEOOOOlO) Mycoplasma pneumoniae, fructose-permease IIBC 
component; 


|(AE001 146) B. burgdorferi predicted coding region BB0406 [Bonelia 


|(AE001146) B. burgdorferi predicted coding region BB0405 [Bonelia 


|(AE001146) B. burgdorferi predicted coding region BB0405 [Borrelia 


|(AE001146) B. burgdorferi predicted coding region BB0406 [Borrelia 


|(AE001 145) glutamate transporter, putative [Borrelia burgdorferi] 


(AEOOl 145) B. burgdorferi predicted coding region BB0399 [Borrelia 
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glycerol dfiffusion facilitator [Pseudomonas aeruginosa] 


(AE001005) glycerol uptake facilitator, MIP channel (glpF) 


(AEOOl 134) B. burgdorferi predicted coding region BB0238 [Borrelia 


(AEOOl 133) B. burgdorferi predicted coding region BB0236 [Borrelia 


(AEOOl 132) gufA protein [Borrelia burgdorferi] >pirlC70127IC70127 


coded for by C. elegans cDNA CEESS55F; coded for by C. elegans 
cDNA 


(AE000831) conserved protein [Methanobacterium thermoautotrophicum] 


gufA gene product [Myxococcus xanthus] >gil49253 orfX gene 


j(AE000757) hypothetical protein [Aquifex aeolicus] 


(AEOO 11 32) phosphate ABC transporter, periplasmic phosphate-binding 


(AE000929) phosphate-binding protein PstS [Methanobacterium 
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SPINDLY [Arabidopsis thaliana] 


|(AE000762) hypothetical protein [Aquifex aeolicus] 


|(AF056198) Hsp70/Hsp90 organizing protein homolog [Drosophila 
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vacuolar ATPase B subunit [Hordeum vulgare] 
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TABLE 3. Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucme 




Valine 


■ c 

Polar 


Glutamine 




Asparagine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 




261 



TABLE 4. Residues Comprising Epito-Bearing Fragments 



Query 


Residues Comprising Epito-Bearing Fragments 


flOl.aa 


from about Lys-62 to about Gly-64, from about Ser-1 1 1 to about 
Asp- 1 13," from about Arg-136 to about Arg-139, from about Pro- 
189 to about Asn-193. 


fll.aa 


from about Pro-38 to about Lys-40, from about Glu-92 to about 
Lys-96. 


fl2.aa 


from about Pro-288 to about Asp-290, from about Asn-336 to 
about Gly-338, from about Tyr-410 to about Gly-413, from about 
Asp-418 to about Arg-420, from about Pro-552 to about Val-555, 
from 




about Gln-643 to about Asp-645, from about Gin- 1061 to about 
Arg-1063, from about Asn-1130 to about Lys-1 132. 


fl29.aa 


from about Glu-76 to about Arg-81, from about Lys-144 to about. 
Asn-146. 


fl47.aa 


from about Gln-94 to about Thr-96. 


fl52.aa 


from about Gly-35 to. about Gly-37, from about Gln-321 to about 
Gly-323. 


fl54.aa 


from about Asn-39 to about Lys-41, from about Ser-74 to about 
Lys-77, from about Ser-213 to about Gly-215, from about Ser-303 
to about Asp-306, from about Asp-422 to about Asn-424. 


fl57.aa 


from about Lys-21 to about Asp-24,. from about Ser-45 to about 
Tyr-47. 


fl7.aa 


from about Arg-17 to about Asn-20, from about Thr-94 to about 
Gly-96. 


fl86.aa 


from about Lys-305 to about Tyr-308. 


fl96.aa 


from about Lys-121 to about Asn-123, from about Pro-278 to about 
Lys-282, from about Glu-576 to about Tyr-578. 


f899.aa 


from about Asn-174 to about Asp- 177. 


f925.aa 


from about Lys-201 to about Asp-204, from about Phe-291 to 
about Lys-294. 


f929.aa 


from about Pro- 139 to about Asn-141, from about Arg-21 1 to about 
Glu-214, from about Thr-370 to about Asn-375. 


f933.aa 


from about Ser-1 39 to about Lys-143. 


f940.aa 


from about Gly-143 to about Asn-148. 


f943.aa 


from about Asp-58 to about Asp-60, from about Lys-1 57 to about 
Asn-159, from about Asp-217 to about Asp-221, from about Lys- 
250 to about Asn-254, from about Pro-262 to about Asn-264, from 
about Gly-305 to about Trp-307. 


f952.aa 


from about Ser-52 to about Ser-54. 


f4.aa 


from about Arg-64 to about Arg-67. 


f43.aa 


from about Ser-84 to about Gln-87, from about Asp-231 to about 
Tyr-233, from about Arg-296 to about Asp-300. 


f50.aa- 


from about Glu-136 to about Gly-138, from about Asp- 153 to . 
about Lys-155, from about Asp-289 to about Asp-291, from about 
Glu-458 to about Asn-461 . 


f65.aa 


from about Glu-120 to about Asp- 122 /from about Pro-204 to about 
Tyr-206. 


fS.aa 


from about Pro-263 to about Arg-265, from about Asb-274 to about 
Lys-278. ■ * 


f82.aa 


from about Tyr-66 to about Giy-68, from about Ser-1 16 to about 
Lys-1 19, from about Asp- 121 to about Gly-123, from about Pro- 
128 to about Gly-131. 
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TABLE 4. Residues Comprising Epi to-Bearing Fragments 



f86.aa 


from about Asn-179 to about Asn-181, from about Lys-192 to 
about Asn-194, from about Lys-270 to about Asn-272, from about 
Lys-279 to about Lys-282, from about Asp-331 to about Asn-333. 


f477.aa 


from about Pro-250 to about Lys-253. 


f488.aa 


from about Lys-76 to about Lys-79, from about Asn-486 to about 
Asp-489, from about Lys-508 to about Gly-510, from about. Asn- 
559 to about Gly-562. 


f494.aa 


from about Lys-76 to about Asn-78. 


f5l6.aa 


from about Lys-32 to. about Asp-34. 


f523.aa 


from about Pro-202 to about Asn-206, from about Lys-255 to about 
Tyr-258. ~ 


f526.aa 


from about Asn-85 to about Lys-88, from about Asp- 136 to about 
Gly-138. ' 


f577.aa 1 


from about Cys-18 to about Lys-22, from about Asn-297 to about 
Gln-300. 


f584.aa 


from about Pro-131 to about Lys-133, from about Pro-200 to about 
Ser-202. 


f596.aa 


from about Arg-42 to about Asp-44, from about Asp- 1 17 to about 
Tyr- 119, from about Pro-205 to about As£-207. 


f600.aa 


from about Pro- 1 43 to about Asp- 1 45 . 


f603.aa 


from about Phe-35 to about Ser-37. 


f607.aa 


from about Gln-67 to about Lys-70, from about Asp-273 to about 
Tyr-275, from about Asp-333 to about Gly-338, from about Pro- 
359 to about Lys-362, from about Arg-409 to about Gly-411. 


f611.aa 


from about Arg-133 to about Gly-135. 


f631.aa 


from about Pro-132 to about Asn-136, from about Asn-159 to 
about Tyr- 161, from about Pro-216 to about Asp-218, from about 
Pro-220 to about Lys-223. 


f688.aa 


from about Lys-266 to about Asp-268, from about Lys-271 to 
about Asn-273, from about Lys-315 to about Lys-318. 


f704.aa 


from about Lys-250 to about Lys-253. 


f707.aa 


from about Lys-131 to about Asp- 134, from about Asp-246 to 
about Asn-249. 


f709.aa 


from about Tyr-39 to about Gly-42, from about Lys-148 to about 
Gly-150, from about Arg-269 to about Gly-272, from about Ser- . 
466 to about Tyr-468, from about Asn-489 to about Asn-491, from 




about Lys-575 to about Asp-578, from about Pro-642 to about Lys- 
644. 


fl97.aa. 


from about Pro-217 to about Asp-219, from about Glu-675 to about 
Asp-678, from about Pro-687 to about Asn-689, from about Glu- 
694 to about Gln-696. 


f200.aa 


from about Arg-174 to about Phe-179. 


f208.aa 


from about Arg-326 to about Ser-328. 


f210.aa 


from about Pro-191 to about De-194. 


f221.aa 


from about Asn-133 to about Asn-135. 


f253.aa 


from about Arg-191 to about Gly-194. 


f269.aa 


from about Ser-27 1 to about Thr-273, from about Asp-284 to about 
Gly-286. 


f29.aa 


from about Pro- 159 to about Ser- 161. 


f290.aa 


from about Pro-240 to about Gly-244. 


f291.aa 


. from about Gln-267 to about Lys-269. 
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TABLE 4. Residues Comprising Epito-Bearing Fragments 





f296.aa 


from about Glu-98 to about Lys-101. 




f3.aa 


from about Asn-241 to about Lys-245. 




OO.aa 


from about Asn-156 to about Tyr-159, from about Asn- 178 to 
about Lys- 180. 




f939.aa 


from about Ser-245 to about Asn-249. 




f739.aa 


from about Asn-80 to about Tyr-82, from about Lys-208 to about 
Ser-210. 




f742 : aa 


from about Ser-141 to about Asp- 145, from about Asn-222 to about 
Gln-225, from about Asp-243 to about Tyr-247, from about Asn-. 
249 to about Asn-25 1 . - 




f743.aa 


from about Arg-1 1 1 to about Gly-1 14, from about Pro- 131 to about 
Asp- 134. 




f790.aa 


from about Thr-40 to about Asn-42, from about Ser-53 to about 
Ser-55, from about'Lys-215 to about Asp-218, from about Asn-274 
to about Gly-277. 


m 


f792.aa 


from about Val-82 to about Ser-84, from about Ser-102 to about 
Asn- 104, from about Gln-127 to about Tyr- 130, from about Lys- 
309 to about Asn-314, from about Lys-375 to about Thr-377, from 


Ul ■ " 
a 




about Pro-5 1 1 to about His-5 13, from about Thr-5 15 to about Asp- 
517. 


fil ■ 
y 


f797.aa 


from about Pro-1 19 to about Gly-122, from about Lys- 166 to about 
Asn- 169. * 


a 


f799.aa 


from about Asn-3 1 to about Asn-34, from about Gln-44 to about 
Asn-47, from about Pro- 123 to about Gly-125. 




f814.aa 


from about Ser-120 to about Ser-122, from about Arg-636 to about 
Asn-638, from about Cys-967 to about Ser-969. 


fll 


f820.aa 


from about Thr-563 to about Tyr-565. 


O 


f850.aa - 


from about Tyr-159 to about Tyr-164, from about Gln-375 to about 
Asp-379. 




f853.aa 


from about Thr-180 to about Lys- 184, from about Arg-231 to about 
Asp-233, from about Asn-252 to. about Gly-254. 




f859.aa 


from about Lys-46 to about Ser-52, from about Pro-88 to about 
Asn-91^ from about Asn- 1 17 to about Asp- 120. 




f86Laa 


from about Asp-38 to about Lys-40, from about Lys-219 to about 
Asn-225. 




f368.aa 


from about Gln-228 to about Asn-23 1 . 




f37Laa 


from about Tyr- 109 to about Asn-1 1 1 , from about Asn- 162 to 
about-Gln-164. 




f502.aa 


from about Asn- 1 18 to about Lys- 122, from about Ser-269 to about 
Gly-271, from about Lys-370 to about Asp-373, from about Asn- 
509 to about Lys-51 1, from about Lys-705 to about Arg-707, from 






about Thr-912 to about Gly-914, from about Pro- 12 13 to about 
Asp- 12 16, from about Asn- 1491 to about Arg-1493. 




1527. aa 


from about Cys-20 to about Gln-22, from about Asn-38 to about 
Asn-40, from about Phe-1 12 to about Asp-1 14, from about Lys- 160 
to about Asn- 162, from about Ser-199 to about Asp-201, from 
about 






Gln-258 to about Gly-261, from about .^rg-282 to about Asn-284, 
from about Ser-297 to about Asp-299. 




f541.aa 


from about Ser-68 to about Asn-7 1 . 




f604.aa 


from about Lys-77 to about Gly-79, from about Lys-201 to about 
Asn-203, from about Asp-252 to about Asp-254, from about Tyr- 
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TABLE 4. Residues Comprising Epito-Bearing Fragments 





/ co aoouc v_riy-Jjw, irom douut nap-ji't to aoout lrp-jio. 


r / jo.aa 


irom aoouc i^ys-zu co apouc .rvbn-zM-, irom aoouc /\rg-i i +/ co aoouc 
Ser-153, from about Ser-231 to about Lys-233. 


1 / 52. aa 


irom aoouc inr-i cy to aoouc .uys- xzz, irom aoouc rTO-4zu co aoouc 
vjiy-^+zz. _ 


f798.aa 


from about Asp-33 to about Thr-36, from about Lys-180 to about 


f635.aa 


from about Pro- 100 to about Asn-102, from about Asp- 145 to 

oUniit Php 1 /17 

clDOUt rne-i i f/. 


f32.aa 


from about Lys-18 to about Asn-20. 


t32u.aa 


irom about Asn-iyo to aoouc Leu-ivo, rrom aoouc oxn-z4o to 
about Lys-250. 


f3.D2.aa 


xrom about oer-4o to about Asn-4V. 


f301.aa 


from about Lys-178 to about Lys-180, from about Ser-401 to about 
Tyr-404. 


f373.aa 


from about Gly-88 to about Lys-90, from about Asn-539 to about 
jLys-j4Z, xrom aoouc ljiu-oj<+ co aoouc oer-oj /. 


f384.aa . • 


from about Pro-250 to about Asn-252, from about Asp-266 to 
about Lys-zoo. 


f446.aa 


from about Asp-20 to about Ser-26, from about Asn-146 to about 
i^ys-i4v. 


ij4z.aa 


irom aoouc Arg-oo co aoouc oiy-oo, irom aoouc /\rg-xo~> 10 douui 

A cn IAS 


ryj.aa 


XiVJill O.UVJUL iVoil 1 Ji. LvJ aUvUL r\jjj* UJ. 


f i ns q o 


irom aoouc r\op ~h-o lu auuuL nic-ju. 


f 1 SA act 


frr*m ahnnt Xhr-9 to ahnnl" A 1 R from about A«;r)-9^6 fo 
irum d.UULlL 1 111 ^ 1*+ LvJ cLUVJLll rVojJ ^.lO, 11 Will auuui rvop lkj 

ohnnt A^n-2SQ 


f219.aa 


from about Asn-77 to about Asn-81, from about Asp-1 1 1 to about 

A en- 1 1 5 


f229.aa 


from about Gln-61 to about Asn-63. 


f32.aa 


from about Lys-18 to about Asn-20. 


rloo.aa 


irom about JLys-JLo to about lyr-JUo. 


f216.aa 


from about Ser-105 to about Asn-107. 


f328.aa 


from about Asn-105 to about Asp- 107. 


f352.aa 


from about Ser-46 to about Asn-49. 


t'867.aa 


from about Thr-3 to about Gly-5, from about Lys-156 to about Ser- 
159. 


f868.aa 


from about Arg-94 to about Gly-96, from about Pro-257 to about 
Gly-26L from about Pro-295 to about Asp-297, irom about Arg- 
340 to about Asp-342. - 


f872.aa 


from about Ser-19 to about Lys-2j, xrom about lnr-ljy to aoouc 
Asp-14z, irom about oer-zoz to aoouc lyr-zoo, irom aoouc ocr-jn 
tn about Ser-313 


f886.aa 


from about Thr-83 to about Asp-85, from about Asp- 106 to about 
Lys-108, from about Lys-143 to about Gly-147, from about Asp- 
186 to about Asn-191. 


f888.aa 


from about Asn-65 to about Asp-67. . 


f893.aa 


from about Asn-203 to abouc Asn-207, from about Thr-446 to 
about Asn-450. 


f605.aa 


from about Arg-3 1 to about Asp-33. 


f606.aa 


from about Asn-68 to about Gly-71, from about Asn-136 to about 
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Lys- 139, rrom about Asn-223 to about lyr-226, from about oer- 
276 to about Tyr-279, from about Pro-362 to about Asn-365, from 




about Arg-503 to about Trp-507. 


f679.aa 


from about Lys- 154 to about Asp- loo, from about Lys-224 to 
about Arg-226, from about Asn-260 to about Asp-264, from about 
Glu-3o3 to about Lys-3oo,,rrom about Asp-3o7 to about Gly-389, 
from 




about Tyr-441 to about Lys-443, from about Axg-501 to about Tyr- 
DU4. 


fll-I2.aa 


from about Pro-91 to about Asn-93, from about Pro-181 to about 
Asp-loo, irom aoout .Lys-z^^- to aoout ber-z4o. 


fll-4.aa 


from about Asn- 1 60 to about Lys- 163. 


f 14-8. aa 


from about Pro-92 to about Gln-95, from about Lys- 123 to about 
Inr-lzD, rrom aoout i_,ys-zi D to aoout Asp-ziy. 


fl7-6.aa 


from about Pro-36 to about Glu-38. 


il9-2.aa 


from about Ser-104 to about Ser-106, from about Gln-230 to about 
Asn-232. 


rl9-4.aa 


from about Val-79 to about Thr-82, from about Pro- 195 to about 
Gly-201. 


fl9-6.aa 


from about Asp-24 to about Lys-30, from about Pro-36 to about 
Glu-38. 


i21-4.aa 


from about Cys-24 to about Asn-26. 


i28-2.aa 


rrom about oer-77 to about Lys-ol), rrom about 1 yr-274 to about . 
Asn-277. 


r28-3.aa 


from about Glu-53 to about Arg-57, from about Gln-82 to about 
Asn-85, from about Gin- 157 to about Asn- 159. 


f31-2.aa 


from about Arg-95 to about Arg-97, from about Asn-297 to about 
Asn-2y9. 


•fVt 1 C n A 

14-lo.aa 


rrom about rro-loz to about Asp-lo4, rrom about Lys-22U to about 
Asp-222. 


f4-50.aa 


from about Thr- 109 to about Asn- 111. 


f42-l.aa 


from about Asn-55 to about Asn-57, from about Arg-81 to about 
Ser-84, from about Asp-94 to about Asn-97. 


f45-2.aa 


from about Asn-83 to about Gly-86. 


f47-2.aa 


from about Ser-29 to about Asp-33, from about Asn-94 to about 
Lys-99, from about Pro- 152 to about Lys- 157. * 


CA ^ n « A 

i49-2.aa 


from about Asn-452 to about Gly-454. 


f5-14.aa 


from about Glu-102 to about Asp- 106, from about Thr-272 to about 
Asn-275, from about Glu-313 to about Asn-315, from about Ser- 
370 to about oer-372. 


PC 1 C 

id-Id. aa 


rrom about Lys- 170 to about Cjly-l 13, rrom about Asn-iy4 to 
about Gly- 196. 


tM-2.aa 


irom about Asp-302 to about Lys- j04. 


lo-zi.aa 


irom aoout oiu-jo to aoout Asn-4-z, irom aoout Arg-on to aoout 
Gly-87. 


f6-27.aa 


from about Asp-67 to about Asn-69, from about Arg-85 to about 
Asn-89", from about Lys- 168 to about Gly- 171, from about Lys- 179 
to about Asn-181, from about Ser-380 to about His-382. 


f6-5.aa 


from about Ser-67 to about Asn-71. 


f7-30.aa 


from about Pro-94 to about Asp-96, from about Lys- 144 to about 
Arg-147. 


f76-l.aa 


from about Asn-30 to about Lys-35, from about Lys-1 13 to about 
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Gly-116, from about Glu-1 19 to about Lys-121. 


f8-10.aa 


from about Pro-25 to about Lys-32, from about Ser-168 to about 
Thr-172. 


f01a.aa_bb001 


from about Pro-123 to about Asp-125, from about Ser-179 to about 
Asp-181, from about Lys-255 to about Gly-259. 


_bb0011 


from about Ala8 about Arg 17, from about Tyr3 1 to about Gly40, 
from about Ser65 to about Lys78, from about Val93 to about 
Aspl02, from about Serl20 to about Ilel29, from about Prol56 to 
about Glul70, from about Lysl87 to about Asn 196, from about 
His205 to about Lys214, from about Gly226 to about Glu235, fro 
about Gln253 to about Asn266, from about Glu283 to about 
GIu293, from about Leu31 1 to about Ile320, from about Arg326 to 
about Gly335, from about Pro340 to about Ala349 


f02a.aa_bb002 


from about Tyr-169 to about Asn- 171, from about Tyr-242 to about 
Asn-245, from about Lys-264 to about Asp-267. 


_bb9 . 


from about Met7 to about Lysl 6, from about Lys47 to about Ser57, 
from about AsnSO to about Ser89, from about Gly 103 to about 
Glul 13, from about Lysl25 to about Pro 133, from about Lysl38 
to about Ala 147 


f03a.aa_bb006 


from about Asp-54 to about Thr-57, from about Lys-201 to about 
His-204. 


_bb014 


from about Pro23 to about Gln3 1, from about Ser37 to about 
Asp45, from about Leu76 to about Asn84, from about Leu76 to 
about Val84, from about Ser89 to about Asn97, from about Serl05 
to about Lysl 13, from about Asn 120 to about Met 128, from about 
Asnl59 to about Gly 167, from about Lysl73 to about Ball81 


_bb023 


from about Asp 17 to about Gly27, from about Arg40 to about 
Asp48, from about Val64 to about Asp72, from about Glul 05 to 
about Thrl 13, from about SerHl to about Gly 150, from about 
Asp 155 to about lie 163, from about Asn 184 to about Lysl 98, from 
about Ile219 to about Pro227, from about Ser230 to about Phe238, 
from about Ser241 to about Asn250, from about Asp270 to about 
Val278 y from about Ser285 to about Leu293, from about Glyu307 
to about Ser315, from about Lys327 to about Asn335 


f08a.aa_bb024 


from about Asn-30 to about Asp-33, from about Ser-1 16 to about 
Asn-1 18, from about Asn-154 to about Gly-156. 


f09a.aa_bb025 


from about Asn-30 to about Ser-35, from about Thr-145 to about 
Asn-148. 
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Applicant's or agent's rlie PB3j 
reference number 



"j^J^TZ International appIicaTioTi^J^Unassigned 



INDICATIONS RELATING TO A DEPOSITED iMICROORGANISM 

(PCT Rule 13 bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 8 , line 8 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary 'institution , 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit August 8, 1998 



Accession Number 202012 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications, e.g., "Accession 
Number of Deposit") 



For receiving Office use only ' 



□ 



This sheet was received with the international application 



Authorized officer 



□ 



For International Bureau use only ' 
This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



